Avro schema

2013-08-01 Thread Lior Schachter
Hi all, When writing Avro schema to the a data file, what will be the expected behavior if the file is used as M/R input. How does the second/third/... splits get the schema (the schema is always written to the first split) ? Thanks, Lior

Re: Mapper not called

2013-08-01 Thread Harsh J
I've often found the issue behind such an observance to be that the input files lack an .avro extension. Is that true in your case? Can you retry after a rename if yes? On Wed, Jul 31, 2013 at 1:02 AM, Anna Lahoud annalah...@gmail.com wrote: I am following directions on

Re: Avro schema

2013-08-01 Thread Harsh J
We read it from the top of the file at start (just the schema bytes) and then initialize the reader. On Thu, Aug 1, 2013 at 8:32 PM, Lior Schachter lior...@gmail.com wrote: Hi all, When writing Avro schema to the a data file, what will be the expected behavior if the file is used as M/R

Re: Avro schema

2013-08-01 Thread Harsh J
Yes, we seek to 0 and we read the header then seek back to the split offset. On Aug 1, 2013 11:16 PM, Lior Schachter lior...@gmail.com wrote: Hi Harsh, So for each split you first read the header of the file directly from HDFS ? Thanks, Lior On Thu, Aug 1, 2013 at 7:36 PM, Harsh J

Avro fIle Header

2013-08-01 Thread Hans Uhlig
Is there a good way to read the schema out of an AVRO data file. I want to write a few generic tools for handling avro files but I dont want to depend on external schema files.