[ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13036890#comment-13036890
 ] 

Hudson commented on MAPREDUCE-2459:
-----------------------------------

Integrated in Hadoop-Mapreduce-trunk-Commit #690 (See 
[https://builds.apache.org/hudson/job/Hadoop-Mapreduce-trunk-Commit/690/])
    MAPREDUCE-2459. Cache HAR filesystem metadata. (Mac Yang via mahadev)

mahadev : 
http://svn.apache.org/viewcvs.cgi/?root=Apache-SVN&view=rev&rev=1125428
Files : 
* /hadoop/mapreduce/trunk/CHANGES.txt
* /hadoop/mapreduce/trunk/src/tools/org/apache/hadoop/fs/HarFileSystem.java


> Cache HAR filesystem metadata
> -----------------------------
>
>                 Key: MAPREDUCE-2459
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
>             Project: Hadoop Map/Reduce
>          Issue Type: Improvement
>          Components: harchive
>            Reporter: Mac Yang
>            Assignee: Mac Yang
>             Fix For: 0.23.0
>
>         Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch
>
>
> Each HAR file system has two index files that contains information on how 
> files are stored in the part files. During the block location calculation, 
> these indexes are reread for every file in the archive. Caching the indexes 
> and the status of the part files will greatly reduce the number of name node 
> operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to