[ 
https://issues.apache.org/jira/browse/HIVE-4123?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13734231#comment-13734231
 ] 

Prasanth J commented on HIVE-4123:
----------------------------------

Thanks for the review Owen.

I have addressed the following issues with this patch
- Date type handled for new encoding
- Better encoding check added by overriding checkEncoding() for valid types
- Created factories for reader and writer creation
- Indentation fix
- DIRECT_V2 encoding can be turned on/off by using "hive.exec.orc.write.format" 
configuration parameter. If this parameter value is "0.11" then old RLE 
encoding will be used else if undefined or for any other values new RLE 
encoding will be used.

Also, HIVE-4324 patch will get affected by this patch. So this new patch is 
generated on top of HIVE-4324.
                
> The RLE encoding for ORC can be improved
> ----------------------------------------
>
>                 Key: HIVE-4123
>                 URL: https://issues.apache.org/jira/browse/HIVE-4123
>             Project: Hive
>          Issue Type: New Feature
>          Components: File Formats
>    Affects Versions: 0.12.0
>            Reporter: Owen O'Malley
>            Assignee: Prasanth J
>              Labels: orcfile
>             Fix For: 0.12.0
>
>         Attachments: HIVE-4123.1.git.patch.txt, HIVE-4123.2.git.patch.txt, 
> HIVE-4123.3.patch.txt, HIVE-4123.4.patch.txt, HIVE-4123.5.txt, 
> HIVE-4123.6.txt, ORC-Compression-Ratio-Comparison.xlsx
>
>
> The run length encoding of integers can be improved:
> * tighter bit packing
> * allow delta encoding
> * allow longer runs

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to