Moving this to the user@avro lists. Please use the right lists for the best answers and the right people.
I'd pick Avro out of the two - it is very well designed for typed data and has a very good implementation of the serializer/deserializer, aside of the schema advantages. FWIW, Avro has a tojson CLI tool to dump Avro binary format out as JSON structures, which would be of help if you seek readability and/or integration with apps/systems that already depend on JSON. On Sun, Aug 12, 2012 at 10:41 PM, Mohit Anchlia <mohitanch...@gmail.com> wrote: > We get data in Json format. I was initially thinking of simply storing Json > in hdfs for processing. I see there is Avro that does the similar thing but > most likely stores it in more optimized format. I wanted to get users > opinion on which one is better. -- Harsh J