Re: Papers discussing Apache Avro

2022-01-14 Thread Ryan Skraba
This is really cool news -- it's always really interesting to see
benchmark studies and the trade-offs we make while choosing different
formats.  Thanks for sharing!

I'd love to see links to some curated articles and papers on the
website!  I created AVRO-3308 if you don't object :D

Ryan

On Fri, Jan 14, 2022 at 10:49 AM Martin Grigorov  wrote:
>
> Hi Juan,
>
> Thank you for sharing your work with us!
>
> It comes right in time for me!
> I am working on the interop tests for the new Rust module and it seems there 
> is some problem to read the .avro files generated by Java. I may need to dive 
> in the binary diffs.
>
> Regards,
> Martin
>
> On Thu, Jan 13, 2022 at 11:14 PM Juan Cruz Viotti  wrote:
>>
>> Hey there!
>>
>> As part of my MSc dissertation at University of Oxford, I wrote and
>> published two papers covering the characteristics of various binary
>> serialization formats, including Apache Avro and performing a
>> space-efficiency benchmark, respectively.
>>
>> Sharing them here in case anybody finds them interesting! The first
>> paper explains how Apache Avro works including an annotated hexadecimal
>> example and the second compares Apache Avro to various other popular
>> serialization formats.
>>
>> - A Survey of JSON-compatible Binary Serialization Specifications:
>>   https://arxiv.org/abs/2201.02089
>> - A Benchmark of JSON-compatible Binary Serialization Specifications:
>>   https://arxiv.org/abs/2201.03051
>>
>> The benchmark study has proved Apache Avro to be one of the most
>> space-efficient formats considered.
>>
>> All the best!
>>
>> --
>> Juan Cruz Viotti
>> Technical Lead @ Postman.com
>> https://www.jviotti.com


Re: Papers discussing Apache Avro

2022-01-14 Thread Martin Grigorov
Hi Juan,

Thank you for sharing your work with us!

It comes right in time for me!
I am working on the interop tests for the new Rust module and it seems
there is some problem to read the .avro files generated by Java. I may need
to dive in the binary diffs.

Regards,
Martin

On Thu, Jan 13, 2022 at 11:14 PM Juan Cruz Viotti  wrote:

> Hey there!
>
> As part of my MSc dissertation at University of Oxford, I wrote and
> published two papers covering the characteristics of various binary
> serialization formats, including Apache Avro and performing a
> space-efficiency benchmark, respectively.
>
> Sharing them here in case anybody finds them interesting! The first
> paper explains how Apache Avro works including an annotated hexadecimal
> example and the second compares Apache Avro to various other popular
> serialization formats.
>
> - A Survey of JSON-compatible Binary Serialization Specifications:
>   https://arxiv.org/abs/2201.02089
> - A Benchmark of JSON-compatible Binary Serialization Specifications:
>   https://arxiv.org/abs/2201.03051
>
> The benchmark study has proved Apache Avro to be one of the most
> space-efficient formats considered.
>
> All the best!
>
> --
> Juan Cruz Viotti
> Technical Lead @ Postman.com
> https://www.jviotti.com
>


Papers discussing Apache Avro

2022-01-13 Thread Juan Cruz Viotti
Hey there!

As part of my MSc dissertation at University of Oxford, I wrote and
published two papers covering the characteristics of various binary
serialization formats, including Apache Avro and performing a
space-efficiency benchmark, respectively.

Sharing them here in case anybody finds them interesting! The first
paper explains how Apache Avro works including an annotated hexadecimal
example and the second compares Apache Avro to various other popular
serialization formats.

- A Survey of JSON-compatible Binary Serialization Specifications:
  https://arxiv.org/abs/2201.02089
- A Benchmark of JSON-compatible Binary Serialization Specifications:
  https://arxiv.org/abs/2201.03051

The benchmark study has proved Apache Avro to be one of the most
space-efficient formats considered.

All the best!

-- 
Juan Cruz Viotti
Technical Lead @ Postman.com
https://www.jviotti.com