[ https://issues.apache.org/jira/browse/HIVE-26809?focusedWorklogId=831580&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-831580 ]
ASF GitHub Bot logged work on HIVE-26809: ----------------------------------------- Author: ASF GitHub Bot Created on: 06/Dec/22 22:28 Start Date: 06/Dec/22 22:28 Worklog Time Spent: 10m Work Description: difin commented on PR #3833: URL: https://github.com/apache/hive/pull/3833#issuecomment-1340096817 > Hello @difin . Thank you for the patch. This looks like a good idea to try to complete before GA of Hive 4.0. > > I see Apache ORC has just released version 1.8.1. Can we use that, so Hive gets on the latest release? > > There are currently numerous test failures in CI, like this one: > > http://ci.hive.apache.org/blue/organizations/jenkins/hive-precommit/detail/PR-3833/1/tests > > I noticed a lot of `ArrayIndexOutOfBoundsException`, like this: > > ``` > Caused by: java.lang.ArrayIndexOutOfBoundsException > at java.lang.System.arraycopy(Native Method) > at org.apache.orc.impl.TreeReaderFactory$StringDictionaryTreeReader.readDictionaryStream(TreeReaderFactory.java:2242) > at org.apache.orc.impl.TreeReaderFactory$StringDictionaryTreeReader.nextVector(TreeReaderFactory.java:2283) > at org.apache.orc.impl.TreeReaderFactory$StringTreeReader.nextVector(TreeReaderFactory.java:1963) > at org.apache.hadoop.hive.ql.io.orc.encoded.EncodedTreeReaderFactory$StringStreamReader.nextVector(EncodedTreeReaderFactory.java:313) > at org.apache.hadoop.hive.llap.io.decode.OrcEncodedDataConsumer.decodeBatch(OrcEncodedDataConsumer.java:196) > at org.apache.hadoop.hive.llap.io.decode.OrcEncodedDataConsumer.decodeBatch(OrcEncodedDataConsumer.java:66) > at org.apache.hadoop.hive.llap.io.decode.EncodedDataConsumer.consumeData(EncodedDataConsumer.java:122) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.sendEcbToConsumer(SerDeEncodedDataReader.java:1687) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.processOneSlice(SerDeEncodedDataReader.java:1059) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.processOneFileSplit(SerDeEncodedDataReader.java:908) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.readFileWithCache(SerDeEncodedDataReader.java:859) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.performDataRead(SerDeEncodedDataReader.java:731) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader$5.run(SerDeEncodedDataReader.java:278) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader$5.run(SerDeEncodedDataReader.java:275) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1878) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.callInternal(SerDeEncodedDataReader.java:275) > at org.apache.hadoop.hive.llap.io.encoded.SerDeEncodedDataReader.callInternal(SerDeEncodedDataReader.java:115) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at org.apache.hadoop.hive.llap.io.decode.EncodedDataConsumer$CpuRecordingCallable.call(EncodedDataConsumer.java:88) > at org.apache.hadoop.hive.llap.io.decode.EncodedDataConsumer$CpuRecordingCallable.call(EncodedDataConsumer.java:73) > ... 5 more > ``` > > Can you please investigate? Hi @cnauroth, thank you for your comments. I am investigating the CI errors and will upgrade to ORC 1.8.1 too. Issue Time Tracking ------------------- Worklog Id: (was: 831580) Time Spent: 40m (was: 0.5h) > Upgrade ORC to 1.8.0 > -------------------- > > Key: HIVE-26809 > URL: https://issues.apache.org/jira/browse/HIVE-26809 > Project: Hive > Issue Type: Improvement > Affects Versions: 4.0.0 > Reporter: Dmitriy Fingerman > Assignee: Dmitriy Fingerman > Priority: Major > Labels: pull-request-available > Time Spent: 40m > Remaining Estimate: 0h > -- This message was sent by Atlassian Jira (v8.20.10#820010)