[ 
https://issues.apache.org/jira/browse/HADOOP-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12559293#action_12559293
 ] 

Raghu Angadi commented on HADOOP-2566:
--------------------------------------

> I'm not sure I completely understand the distinction. In one case are you 
> passing a path without any meta characters but that does not exist, and in 
> the other one with metacharacters but that matches no files?

yes. e.g. following two commands have different contents in stderr:
- {{bin/hadoop fs -cat '/tmp/nonexistent*' /tmp/exists}}
- {{bin/hadoop fs -cat '/tmp/nonexistent' /tmp/exists}}

This is how the current behavior is.

> If the distinction is important then perhaps the non-existing file case 
> should return null, while the non-matching expression case should return an 
> empty array.
Sounds good. globPaths() can use this keep the current behavior unchanged.

> need FileSystem#globStatus method
> ---------------------------------
>
>                 Key: HADOOP-2566
>                 URL: https://issues.apache.org/jira/browse/HADOOP-2566
>             Project: Hadoop
>          Issue Type: Improvement
>          Components: fs
>            Reporter: Doug Cutting
>            Assignee: Hairong Kuang
>             Fix For: 0.16.0
>
>
> To remove the cache of FileStatus in DFSPath (HADOOP-2565) without hurting 
> performance, we must use file enumeration APIs that return FileStatus[] 
> rather than Path[].  Currently we have FileSystem#globPaths(), but that 
> method should be deprecated and replaced with a FileSystem#globStatus().
> We need to deprecate FileSystem#globPaths() in 0.16 in order to remove the 
> cache in 0.17.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to