Re: Check compression codec of an HDFS file

2013-12-05 Thread alex bohr
The SequenceFile.Reader will work PErfect! (I should have seen that). As always - thanks Harsh On Thu, Dec 5, 2013 at 2:22 AM, Harsh J wrote: > If you're looking for file header/contents based inspection, you could > download the file and run the Linux utility 'file' on the file, and it > sho

Re: Check compression codec of an HDFS file

2013-12-05 Thread Harsh J
If you're looking for file header/contents based inspection, you could download the file and run the Linux utility 'file' on the file, and it should tell you the format. I don't know about Snappy (AFAIK, we don't have a snappy frame/container format support in Hadoop yet, although upstream Snappy

Check compression codec of an HDFS file

2013-12-04 Thread alex bohr
What's the best way to check the compression codec that an HDFS file was written with? We use both Gzip and Snappy compression so I want a way to determine how a specific file is compressed. The closest I found is the *getCodec