Huy, I think this question probably belongs in the Hadoop mailing list over the Pig mailing list. However, I think you're looking for http://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/fs/FileStatus.html A FileStatus object can be acquired from a FileSystem object by calling the .getFileStatus(Path path) method.
http://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/fs/FileSystem.html https://hadoop.apache.org/docs/r1.0.4/api/org/apache/hadoop/fs/Path.html Hope this helps. On Tue, Jul 23, 2013 at 2:05 PM, Huy Pham <[email protected]> wrote: > Hi All, > Do any of you have or can refer me to some sample Java code that get > the tree structure of a HDFS directory, similar to the file system? > For example: I have a HDFS dir, called /data, inside data, there is > /data/valid and /data/invalid, and so on, so I would need to be able to get > the whole tree structure of that and know which is is a dir, which one is a > file. Both program and HDFS are LOCAL. > In other words, what I look for is something similar to File class in > Java, which has isDirectory() and list() to list all the children (files > and dirs) of a dir. Found something in stackoverflow but it does not work. > Thanks > Huy > > >
