Cheolsoo Park created PIG-2856:
----------------------------------
Summary: AvroStorage doesn't load files in the directories when a
glob pattern matches both files and directories.
Key: PIG-2856
URL: https://issues.apache.org/jira/browse/PIG-2856
Project: Pig
Issue Type: Bug
Components: piggybank
Affects Versions: 0.10.0
Reporter: Cheolsoo Park
Assignee: Cheolsoo Park
This is a regression from PIG-2492.
When a glob pattern such as '*' matches not only files but also directories,
AvroStorage does not load files in the directories. This is a bug in
getAllSubDirs() that can be fixed as follows:
{code}
static boolean getAllSubDirs(Path path, Job job, Set<Path> paths)
...
FileStatus[] matchedFiles = fs.globStatus(path, PATH_FILTER);
...
for (FileStatus file : matchedFiles) {
if (file.isDir()) {
- for (FileStatus sub : fs.listStatus(path)) {
+ for (FileStatus sub : fs.listStatus(file.getPath())) {
getAllSubDirs(sub.getPath(), job, paths);
}
}
}
{code}
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira