[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13036890#comment-13036890 ]
Hudson commented on MAPREDUCE-2459: ----------------------------------- Integrated in Hadoop-Mapreduce-trunk-Commit #690 (See [https://builds.apache.org/hudson/job/Hadoop-Mapreduce-trunk-Commit/690/]) MAPREDUCE-2459. Cache HAR filesystem metadata. (Mac Yang via mahadev) mahadev : http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1125428 Files : * /hadoop/mapreduce/trunk/CHANGES.txt * /hadoop/mapreduce/trunk/src/tools/org/apache/hadoop/fs/HarFileSystem.java > Cache HAR filesystem metadata > ----------------------------- > > Key: MAPREDUCE-2459 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: harchive > Reporter: Mac Yang > Assignee: Mac Yang > Fix For: 0.23.0 > > Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch > > > Each HAR file system has two index files that contains information on how > files are stored in the part files. During the block location calculation, > these indexes are reread for every file in the archive. Caching the indexes > and the status of the part files will greatly reduce the number of name node > operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira