[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Mac Yang updated MAPREDUCE-2459: -------------------------------- Status: Patch Available (was: Open) > Cache HAR filesystem metadata > ----------------------------- > > Key: MAPREDUCE-2459 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: harchive > Reporter: Mac Yang > Assignee: Mac Yang > Fix For: 0.23.0 > > Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch > > > Each HAR file system has two index files that contains information on how > files are stored in the part files. During the block location calculation, > these indexes are reread for every file in the archive. Caching the indexes > and the status of the part files will greatly reduce the number of name node > operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira