Hey Harsh, thanks. I can confirm that the first one works. Let me try the second one.
Tomas On Sun, Feb 16, 2014 at 8:07 AM, Harsh J <ha...@cloudera.com> wrote: > Hi, > > For (1) I believe you could do a "Schema.parse meta['avro.schema']" to > obtain the schema as an object from the meta entry of the file. > > For (2), as defined in the spec at > http://avro.apache.org/docs/current/spec.html#Object+Container+Files, > since the schema is stored only in the header of the file, using a > simple initialised reader will be efficient in reading just that. The > file's data blocks are read only upon enumerating over the reader. > > On Sun, Feb 16, 2014 at 4:52 AM, Tomas Svarovsky > <svarovsky.to...@gmail.com> wrote: > > Hey, > > > > I wanted to ask couple of questions. > > > > 1) Let's assume I have 2 avro files. I would like to grab schemas of > both. > > Compare them and decide what to do. The only way I found to get to the > > schema in a reader is through > > > > dr = Avro::DataFile::Reader.new(file, Avro::IO::DatumReader.new) > > dr.meta > > > > and that is still stringified JSON. Is this the only way or even is this > use > > case something supported or should I do it differently? > > > > 2) Also is ti possible to read just the schema? Sometimes it is useful to > > look at a file without actually reading the whole file let's say from s3. > > > > Regards Tomas > > > > -- > Harsh J >