[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mahadev konar updated MAPREDUCE-2459: - Resolution: Fixed Hadoop Flags: [Reviewed] Status: Resolved (was: Patch Available) I just committed this to trunk. Thanks mac! Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Fix For: 0.23.0 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mac Yang updated MAPREDUCE-2459: Status: Open (was: Patch Available) Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Fix For: 0.23.0 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mac Yang updated MAPREDUCE-2459: Status: Patch Available (was: Open) Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Fix For: 0.23.0 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mac Yang updated MAPREDUCE-2459: Attachment: MAPREDUCE-2459.2.patch Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Fix For: 0.23.0 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mahadev konar updated MAPREDUCE-2459: - Affects Version/s: 0.23.0 Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Fix For: 0.23.0 Attachments: MAPREDUCE-2459.1.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mahadev konar updated MAPREDUCE-2459: - Affects Version/s: (was: 0.23.0) Fix Version/s: 0.23.0 Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Fix For: 0.23.0 Attachments: MAPREDUCE-2459.1.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mac Yang updated MAPREDUCE-2459: Attachment: MAPREDUCE-2459.1.patch Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Attachments: MAPREDUCE-2459.1.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata
[ https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mac Yang updated MAPREDUCE-2459: Status: Patch Available (was: Open) Cache HAR filesystem metadata - Key: MAPREDUCE-2459 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459 Project: Hadoop Map/Reduce Issue Type: Improvement Components: harchive Reporter: Mac Yang Assignee: Mac Yang Attachments: MAPREDUCE-2459.1.patch Each HAR file system has two index files that contains information on how files are stored in the part files. During the block location calculation, these indexes are reread for every file in the archive. Caching the indexes and the status of the part files will greatly reduce the number of name node operations during the job setup time. -- This message is automatically generated by JIRA. For more information on JIRA, see: http://www.atlassian.com/software/jira