[jira] [Work logged] (HIVE-26809) Upgrade ORC to 1.8.1

ASF GitHub Bot (Jira) Tue, 17 Jan 2023 09:46:04 -0800


     [ 
https://issues.apache.org/jira/browse/HIVE-26809?focusedWorklogId=839725&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-839725
 ]


ASF GitHub Bot logged work on HIVE-26809:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 17/Jan/23 17:44
            Start Date: 17/Jan/23 17:44
    Worklog Time Spent: 10m 
      Work Description: difin commented on code in PR #3833:
URL: https://github.com/apache/hive/pull/3833#discussion_r1072298943


##########
ql/src/java/org/apache/hadoop/hive/ql/io/orc/encoded/EncodedTreeReaderFactory.java:
##########
@@ -224,7 +237,252 @@ private static void skipCompressedIndex(boolean 
isCompressed, PositionProvider i
     index.getNext();
   }
 
-  protected static class StringStreamReader extends StringTreeReader
+  public static class StringDictionaryTreeReaderHive extends TreeReader {

Review Comment:
   Hi @ayushtkn, I agree with you. It is not ideal approach. Before 
implementing this approach I did try to adapt Hive, but I didn't succeed to 
find how Hive could be adapted to ORC-1060 changes because those changes are 
inside internal implementation of Orc StringDictionaryTreeReader class. 
   
   I agree with you that this approach is not ideal and will backfire in future 
when we try to upgrade and the changes in ORC depends on the ones which we 
ditched, but Hive already heavily depends on internal ORC API by implementing 
its own column readers on top of ORC and when upgrading to different ORC 
version it is often required to make adaptations in Hive.





Issue Time Tracking
-------------------

    Worklog Id:     (was: 839725)
    Time Spent: 5h 20m  (was: 5h 10m)

> Upgrade ORC to 1.8.1
> --------------------
>
>                 Key: HIVE-26809
>                 URL: https://issues.apache.org/jira/browse/HIVE-26809
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 4.0.0
>            Reporter: Dmitriy Fingerman
>            Assignee: Dmitriy Fingerman
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 5h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Work logged] (HIVE-26809) Upgrade ORC to 1.8.1

Reply via email to