Hi all,
When writing Avro schema to the a data file, what will be the expected
behavior if the file is used as M/R input. How does the second/third/...
splits get the schema (the schema is always written to the first split) ?
Thanks,
Lior
I've often found the issue behind such an observance to be that the
input files lack an .avro extension. Is that true in your case? Can
you retry after a rename if yes?
On Wed, Jul 31, 2013 at 1:02 AM, Anna Lahoud annalah...@gmail.com wrote:
I am following directions on
We read it from the top of the file at start (just the schema bytes)
and then initialize the reader.
On Thu, Aug 1, 2013 at 8:32 PM, Lior Schachter lior...@gmail.com wrote:
Hi all,
When writing Avro schema to the a data file, what will be the expected
behavior if the file is used as M/R
Yes, we seek to 0 and we read the header then seek back to the split offset.
On Aug 1, 2013 11:16 PM, Lior Schachter lior...@gmail.com wrote:
Hi Harsh,
So for each split you first read the header of the file directly from HDFS
?
Thanks,
Lior
On Thu, Aug 1, 2013 at 7:36 PM, Harsh J
Is there a good way to read the schema out of an AVRO data file. I want to
write a few generic tools for handling avro files but I dont want to depend
on external schema files.