[ 
https://issues.apache.org/jira/browse/ORC-373?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16498823#comment-16498823
 ] 

ASF GitHub Bot commented on ORC-373:
------------------------------------

GitHub user prasanthj opened a pull request:

    https://github.com/apache/orc/pull/276

    ORC-373: Option to disable dictionary encoding

    disables dictionary encoding at the time of string tree writer 
initialization.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/prasanthj/orc master

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/orc/pull/276.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #276
    
----
commit 90d83a11f1411df52712c267ea4949c4271c9828
Author: Prasanth Jayachandran <prasanthj@...>
Date:   2018-06-02T02:24:22Z

    ORC-373: Option to disable dictionary encoding

----


> Option to disable dictionary encoding 
> --------------------------------------
>
>                 Key: ORC-373
>                 URL: https://issues.apache.org/jira/browse/ORC-373
>             Project: ORC
>          Issue Type: Bug
>    Affects Versions: 1.5.2
>            Reporter: Prasanth Jayachandran
>            Assignee: Prasanth Jayachandran
>            Priority: Major
>
> Currently dictionary check happens after creation of first row group entry. 
> Even when row indexes are disabled, rows end up in red-black tree first 
> before getting flushed during write stripe (into direct stream).
> If dictionary threshold is set to <= 0.0 disable dictionary, we should write 
> directly to stream instead of RBTree. This is useful for hive streaming 
> ingest where delta files explicitly disables dictionaries. 



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to