[ 
https://issues.apache.org/jira/browse/HADOOP-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559299#action_12559299
 ] 

Doug Cutting commented on HADOOP-2566:
--------------------------------------

A few comments:
- should stat2paths be a public method on FileSystem?  I'd prefer it were 
either private or perhaps on FileUtil.
- globPaths() isn't deprecated.  Do we think we'll keep this, or should it be 
deprecated?  It is handy in some cases, but, on the other hand, we'd like to 
force folks to examine their uses of it, since in most cases performance will 
become abysmal once the FileStatus cache is removed, and we don't want to 
surprise folks with that.  Thoughts?


> need FileSystem#globStatus method
> ---------------------------------
>
>                 Key: HADOOP-2566
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2566
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Doug Cutting
>            Assignee: Hairong Kuang
>             Fix For: 0.16.0
>
>         Attachments: globStatus.patch
>
>
> To remove the cache of FileStatus in DFSPath (HADOOP-2565) without hurting 
> performance, we must use file enumeration APIs that return FileStatus[] 
> rather than Path[].  Currently we have FileSystem#globPaths(), but that 
> method should be deprecated and replaced with a FileSystem#globStatus().
> We need to deprecate FileSystem#globPaths() in 0.16 in order to remove the 
> cache in 0.17.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to