Owen O'Malley created HIVE-4244:
-----------------------------------

             Summary: Make string dictionaries adaptive in ORC
                 Key: HIVE-4244
                 URL: https://issues.apache.org/jira/browse/HIVE-4244
             Project: Hive
          Issue Type: Bug
          Components: Serializers/Deserializers
            Reporter: Owen O'Malley
            Assignee: Owen O'Malley


The ORC writer should adaptively switch between dictionary and direct encoding. 
I'd propose looking at the first 100,000 values in each column and decide 
whether there is sufficient loading in the dictionary to use dictionary 
encoding.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to