FileInputFormat to support multi-level/recursive directory listing ------------------------------------------------------------------
Key: MAPREDUCE-1501 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1501 Project: Hadoop Map/Reduce Issue Type: Improvement Reporter: Zheng Shao As we have seen multiple times in the mailing list, users want to have the capability of getting all files out of a multi-level directory structure. 4/1/2008: http://mail-archives.apache.org/mod_mbox/hadoop-core-user/200804.mbox/%3ce75c02ef0804011433x144813e6x2450da7883de3...@mail.gmail.com%3e 2/3/2009: http://mail-archives.apache.org/mod_mbox/hadoop-core-user/200902.mbox/%3c7f80089c-3e7f-4330-90ba-6f1c5b0b0...@nist.gov%3e 6/2/2009: http://mail-archives.apache.org/mod_mbox/hadoop-common-user/200906.mbox/%3c4a258a16.8050...@darose.net%3e One solution that our users had is to write a new FileInputFormat, but that means all existing FileInputFormat subclasses need to be changed in order to support this feature. We can easily provide a JobConf option (which defaults to false) to {{FileInputFormat.listStatus(...)}} to recursively go into directory structure. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.