[ 
https://issues.apache.org/jira/browse/HIVE-26809?focusedWorklogId=831580&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-831580
 ]

ASF GitHub Bot logged work on HIVE-26809:
-----------------------------------------

                Author: ASF GitHub Bot
            Created on: 06/Dec/22 22:28
            Start Date: 06/Dec/22 22:28
    Worklog Time Spent: 10m 
      Work Description: difin commented on PR #3833:
URL: https://github.com/apache/hive/pull/3833#issuecomment-1340096817

   > Hello @difin . Thank you for the patch. This looks like a good idea to try 
to complete before GA of Hive 4.0.
   > 
   > I see Apache ORC has just released version 1.8.1. Can we use that, so Hive 
gets on the latest release?
   > 
   > There are currently numerous test failures in CI, like this one:
   > 
   > 
http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-3833/1/tests
   > 
   > I noticed a lot of `ArrayIndexOutOfBoundsException`, like this:
   > 
   > ```
   > Caused by: java.lang.ArrayIndexOutOfBoundsException
   >    at java.lang.System.arraycopy(Native Method)
   >    at 
org.apache.orc.impl.TreeReaderFactory$StringDictionaryTreeReader.readDictionaryStream(TreeReaderFactory.java:2242)
   >    at 
org.apache.orc.impl.TreeReaderFactory$StringDictionaryTreeReader.nextVector(TreeReaderFactory.java:2283)
   >    at 
org.apache.orc.impl.TreeReaderFactory$StringTreeReader.nextVector(TreeReaderFactory.java:1963)
   >    at 
org.apache.hadoop.hive.ql.io.orc.encoded.EncodedTreeReaderFactory$StringStreamReader.nextVector(EncodedTreeReaderFactory.java:313)
   >    at 
org.apache.hadoop.hive.llap.io.decode.OrcEncodedDataConsumer.decodeBatch(OrcEncodedDataConsumer.java:196)
   >    at 
org.apache.hadoop.hive.llap.io.decode.OrcEncodedDataConsumer.decodeBatch(OrcEncodedDataConsumer.java:66)
   >    at 
org.apache.hadoop.hive.llap.io.decode.EncodedDataConsumer.consumeData(EncodedDataConsumer.java:122)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.sendEcbToConsumer(SerDeEncodedDataReader.java:1687)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.processOneSlice(SerDeEncodedDataReader.java:1059)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.processOneFileSplit(SerDeEncodedDataReader.java:908)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.readFileWithCache(SerDeEncodedDataReader.java:859)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.performDataRead(SerDeEncodedDataReader.java:731)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader$5.run(SerDeEncodedDataReader.java:278)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader$5.run(SerDeEncodedDataReader.java:275)
   >    at java.security.AccessController.doPrivileged(Native Method)
   >    at javax.security.auth.Subject.doAs(Subject.java:422)
   >    at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1878)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.callInternal(SerDeEncodedDataReader.java:275)
   >    at 
org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.callInternal(SerDeEncodedDataReader.java:115)
   >    at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36)
   >    at 
org.apache.hadoop.hive.llap.io.decode.EncodedDataConsumer$CpuRecordingCallable.call(EncodedDataConsumer.java:88)
   >    at 
org.apache.hadoop.hive.llap.io.decode.EncodedDataConsumer$CpuRecordingCallable.call(EncodedDataConsumer.java:73)
   >    ... 5 more
   > ```
   > 
   > Can you please investigate?
   
   Hi @cnauroth, thank you for your comments. I am investigating the CI errors 
and will upgrade to ORC 1.8.1 too.




Issue Time Tracking
-------------------

    Worklog Id:     (was: 831580)
    Time Spent: 40m  (was: 0.5h)

> Upgrade ORC to 1.8.0
> --------------------
>
>                 Key: HIVE-26809
>                 URL: https://issues.apache.org/jira/browse/HIVE-26809
>             Project: Hive
>          Issue Type: Improvement
>    Affects Versions: 4.0.0
>            Reporter: Dmitriy Fingerman
>            Assignee: Dmitriy Fingerman
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to