[ 
https://issues.apache.org/jira/browse/AVRO-806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13058098#comment-13058098
 ] 

Doug Cutting commented on AVRO-806:
-----------------------------------

> I think we should make unions columnar as well.

That would be nice, but I'd rather we have something useful sooner than 
something perfect later.  We can extend it later in a backward-compatible 
manner.  It would not be forward compatible, but that might be acceptable as 
long as there's only a single implementation (Java).

> If we want to avoid decompressing columns that are not accessed [ ... ]

I think the advantage of a columnar format is to avoid touching data that's not 
needed, and avoiding decompression is consistent with that.

> add a column-major codec for data files
> ---------------------------------------
>
>                 Key: AVRO-806
>                 URL: https://issues.apache.org/jira/browse/AVRO-806
>             Project: Avro
>          Issue Type: New Feature
>          Components: java, spec
>            Reporter: Doug Cutting
>            Assignee: Doug Cutting
>         Attachments: AVRO-806-v2.patch, AVRO-806.patch, avro-file-columnar.pdf
>
>
> Define a codec that, when a data file's schema is a record schema, writes 
> blocks within the file in column-major order.  This would permit better 
> compression and also permit efficient skipping of fields that are not of 
> interest.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to