Re: Sequence File usage queries

2011-03-07 Thread David Rosenstrauch
On 02/23/2011 07:24 PM, Mapred Learn wrote: Thanks ! In this case, how can we print the metadata associated with the data (sequence files), if user accessing this data wants to know it: i) Is there any hadoop command that can do it ? ii) Or we will have to provide some interface to the user to s

Re: Sequence File usage queries

2011-02-23 Thread Ted Yu
I didn't find SequenceFile metadata viewer. You need to write some code for #2 below. On Wed, Feb 23, 2011 at 4:24 PM, Mapred Learn wrote: > Thanks ! > > In this case, how can we print the metadata associated with the data > (sequence files), if user accessing this data wants to know it: > i) Is

Re: Sequence File usage queries

2011-02-23 Thread Mapred Learn
Thanks ! In this case, how can we print the metadata associated with the data (sequence files), if user accessing this data wants to know it: i) Is there any hadoop command that can do it ? ii) Or we will have to provide some interface to the user to see the metadata ? -JJ On Sat, Feb 19, 2011 a

Re: Sequence File usage queries

2011-02-19 Thread Ted Yu
Option 2 is better. Please see this in SequenceFile: public static Writer createWriter(FileSystem fs, Configuration conf, Path name, Class keyClass, Class valClass, int bufferSize, short replication, long blockSize, CompressionType compressio

Sequence File usage queries

2011-02-17 Thread Mapred Learn
Hi, I have a use case to upload some tera-bytes of text files as sequences files on HDFS. These text files have several layouts ranging from 32 to 62 columns (metadata). What would be a good way to upload these files along with their metadata: i) creating a key, value class per text file layout