Hello Gang,

I've recently had some space to look at Avro again recently (I enjoy
contributing to something that has such a wide industry impact).

In thinking about the block format of Avro, it currently stores Metadata
about the number of records in each block. I'm performing a thought
exercise of replacing the count field with a map and allowing for a more
generic set of metadata. In particular, would want to add better scan
support: Bloom filters, min, max values.

Making this backwards compatible looks hard at first, but does anyone in
the community see value here?


Thanks.

Reply via email to