[ 
https://issues.apache.org/jira/browse/HADOOP-6467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Mahadev konar updated HADOOP-6467:
----------------------------------

    Attachment: Archives_performance.docx

An updated doc that does a much better job of explaining the current 
architecture of hadoop archives and also points out the performance drawback 
and the solution for which this jira has been created. comments are welcome.

> Performance improvement for liststatus on directories in hadoop archives.
> -------------------------------------------------------------------------
>
>                 Key: HADOOP-6467
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6467
>             Project: Hadoop Common
>          Issue Type: Improvement
>            Reporter: Mahadev konar
>            Assignee: Mahadev konar
>             Fix For: 0.22.0
>
>         Attachments: Archives_performance.docx, Archives_performance.docx
>
>
> A liststatus call on a directory in hadoop archives leads to ( 2* number of 
> files in directory) open calls to the namenode. This is very sub optimal and 
> needs to be fixed to make it performant enough to be used on a daily basis. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to