Hi Gang, Answers inline. On 3/16/10 9:58 AM, "Gang Luo" <lgpub...@yahoo.com.cn> wrote:
Hi all, what is the life length of the distributed cache files? Localized cache file will be removed, if the file is not used by any job and localized disk space on the machine goes higher than configured local.cache.size(by default, 10 GB). Will hadoop redistributed the same file to the same node twice if it is being used by two jobs? No, It will be localized only once. Both the jobs will use the same localized file. If the file gets modified on DFS, then it will be localized once again. Thanks Amareshwari