[ 
https://issues.apache.org/jira/browse/HADOOP-6870?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12893320#action_12893320
 ] 

Dmytro Molkov commented on HADOOP-6870:
---------------------------------------

I looked over the patch and overall looks good to me.
The only question I had is the hash/equals of the LocatedFileStatus. Do you 
ever see a need to have two instances of LocatedFileStatus that differ only in 
block locations be not equal? Because right now the equality is the same as the 
FileStatus equality and doesn't take block locations into account, right?

> Add FileSystem#listLocatedStatus to list a directory's content together with 
> each file's block locations
> --------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-6870
>                 URL: https://issues.apache.org/jira/browse/HADOOP-6870
>             Project: Hadoop Common
>          Issue Type: New Feature
>          Components: fs
>    Affects Versions: 0.22.0
>            Reporter: Hairong Kuang
>            Assignee: Hairong Kuang
>             Fix For: 0.22.0
>
>         Attachments: listFiles.patch, listFiles1.patch, listFiles2.patch, 
> listFiles3.patch, listFiles4.patch
>
>
> This jira implements the new FileSystem API as proposed in HDFS-202. The new 
> API aims to eliminate individual "getFileBlockLocations" calls to NN for each 
> file in the input directory of a job. Instead, a file's block locations are 
> returned together with FileStatus when listing a directory, thus improving 
> getSplits performance.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to