[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-05-20 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2459:
-

  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

I just committed this to trunk. Thanks mac!

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Fix For: 0.23.0

 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-05-13 Thread Mac Yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mac Yang updated MAPREDUCE-2459:


Status: Open  (was: Patch Available)

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Fix For: 0.23.0

 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-05-13 Thread Mac Yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mac Yang updated MAPREDUCE-2459:


Status: Patch Available  (was: Open)

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Fix For: 0.23.0

 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-05-13 Thread Mac Yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mac Yang updated MAPREDUCE-2459:


Attachment: MAPREDUCE-2459.2.patch

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Fix For: 0.23.0

 Attachments: MAPREDUCE-2459.1.patch, MAPREDUCE-2459.2.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-05-12 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2459:
-

Affects Version/s: 0.23.0

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Fix For: 0.23.0

 Attachments: MAPREDUCE-2459.1.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-05-12 Thread Mahadev konar (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated MAPREDUCE-2459:
-

Affects Version/s: (was: 0.23.0)
Fix Version/s: 0.23.0

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Fix For: 0.23.0

 Attachments: MAPREDUCE-2459.1.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-04-29 Thread Mac Yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mac Yang updated MAPREDUCE-2459:


Attachment: MAPREDUCE-2459.1.patch

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Attachments: MAPREDUCE-2459.1.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (MAPREDUCE-2459) Cache HAR filesystem metadata

2011-04-29 Thread Mac Yang (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAPREDUCE-2459?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mac Yang updated MAPREDUCE-2459:


Status: Patch Available  (was: Open)

 Cache HAR filesystem metadata
 -

 Key: MAPREDUCE-2459
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-2459
 Project: Hadoop Map/Reduce
  Issue Type: Improvement
  Components: harchive
Reporter: Mac Yang
Assignee: Mac Yang
 Attachments: MAPREDUCE-2459.1.patch


 Each HAR file system has two index files that contains information on how 
 files are stored in the part files. During the block location calculation, 
 these indexes are reread for every file in the archive. Caching the indexes 
 and the status of the part files will greatly reduce the number of name node 
 operations during the job setup time.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira