Out of curiosity, is there a way to write output from a Crunch pipeline
into an Avro-format file? It seems that if you do the
collection.write(To.avroFile(path)), you end up just writing JSON. It can
certainly be read into an Avro object, but it seems like it would be more
efficient to write binary data to the file, so no parsing has to happen.

Have I missed an API, or is this a missing feature?

Thanks,
Natty

Reply via email to