[ https://issues.apache.org/jira/browse/HIVE-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Krishna Kumar updated HIVE-2600: -------------------------------- Attachment: HIVE-2600.v1.patch Added logging to report every column's total lengths. > Enable/Add type-specific compression for rcfile > ----------------------------------------------- > > Key: HIVE-2600 > URL: https://issues.apache.org/jira/browse/HIVE-2600 > Project: Hive > Issue Type: Sub-task > Components: Query Processor, Serializers/Deserializers > Reporter: Krishna Kumar > Assignee: Krishna Kumar > Priority: Minor > Attachments: HIVE-2600.v0.patch, HIVE-2600.v1.patch > > > Enable schema-aware compression codecs which can perform type-specific > compression on a per-column basis. I see this as in three-parts > 1. Add interfaces for the rcfile to communicate column information to the > codec > 2. Add an "uber compressor" which can perform column-specific compression on > a per-block basis. Initially, this can be config driven, but we can go for a > dynamic implementation later. > 3. A bunch of type-specific compressors > This jira is for the first part of the effort. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira