[ https://issues.apache.org/jira/browse/HIVE-16761?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16054749#comment-16054749 ]
Sergey Shelukhin edited comment on HIVE-16761 at 6/19/17 9:16 PM: ------------------------------------------------------------------ After fixing HIVE-16915, the error changes to {noformat} java.lang.RuntimeException: java.io.IOException: java.io.IOException: java.io.IOException: cannot find dir = hdfs://.../apps/hive/warehouse/customer_accounts_orc_200/000048_0 in pathToPartitionInfo: [hdfs://.../apps/hive/warehouse/transactions_raw_orc_200/year=2016/quarter=3, hdfs://.../apps/hive/warehouse/transactions_raw_orc_200/year=2016/quarter=4, hdfs://.../apps/hive/warehouse/transactions_raw_orc_200/year=2017/quarter=2, hdfs://.../apps/hive/warehouse/transactions_raw_orc_200/year=2017/quarter=3] at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.initNextRecordReader(TezGroupedSplitsInputFormat.java:206) at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.<init>(TezGroupedSplitsInputFormat.java:145) at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat.getRecordReader(TezGroupedSplitsInputFormat.java:111) at org.apache.tez.mapreduce.lib.MRReaderMapred.setupOldRecordReader(MRReaderMapred.java:157) at org.apache.tez.mapreduce.lib.MRReaderMapred.<init>(MRReaderMapred.java:76) at org.apache.tez.mapreduce.input.MultiMRInput.initFromEvent(MultiMRInput.java:196) at org.apache.tez.mapreduce.input.MultiMRInput.handleEvents(MultiMRInput.java:154) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.handleEvent(LogicalIOProcessorRuntimeTask.java:715) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.access$600(LogicalIOProcessorRuntimeTask.java:105) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask$1.runInternal(LogicalIOProcessorRuntimeTask.java:792) at org.apache.tez.common.RunnableWithNdc.run(RunnableWithNdc.java:35) at java.lang.Thread.run(Thread.java:745) {noformat} was (Author: sershe): After fixing HIVE-16915, the error changes to {noformat} java.lang.RuntimeException: java.io.IOException: java.io.IOException: java.io.IOException: cannot find dir = hdfs://cn105-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/customer_accounts_orc_200/000048_0 in pathToPartitionInfo: [hdfs://cn105-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/transactions_raw_orc_200/year=2016/quarter=3, hdfs://cn105-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/transactions_raw_orc_200/year=2016/quarter=4, hdfs://cn105-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/transactions_raw_orc_200/year=2017/quarter=2, hdfs://cn105-10.l42scl.hortonworks.com:8020/apps/hive/warehouse/transactions_raw_orc_200/year=2017/quarter=3] at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.initNextRecordReader(TezGroupedSplitsInputFormat.java:206) at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat$TezGroupedSplitsRecordReader.<init>(TezGroupedSplitsInputFormat.java:145) at org.apache.hadoop.mapred.split.TezGroupedSplitsInputFormat.getRecordReader(TezGroupedSplitsInputFormat.java:111) at org.apache.tez.mapreduce.lib.MRReaderMapred.setupOldRecordReader(MRReaderMapred.java:157) at org.apache.tez.mapreduce.lib.MRReaderMapred.<init>(MRReaderMapred.java:76) at org.apache.tez.mapreduce.input.MultiMRInput.initFromEvent(MultiMRInput.java:196) at org.apache.tez.mapreduce.input.MultiMRInput.handleEvents(MultiMRInput.java:154) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.handleEvent(LogicalIOProcessorRuntimeTask.java:715) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.access$600(LogicalIOProcessorRuntimeTask.java:105) at org.apache.tez.runtime.LogicalIOProcessorRuntimeTask$1.runInternal(LogicalIOProcessorRuntimeTask.java:792) at org.apache.tez.common.RunnableWithNdc.run(RunnableWithNdc.java:35) at java.lang.Thread.run(Thread.java:745) {noformat} > LLAP IO: SMB joins fail elevator > --------------------------------- > > Key: HIVE-16761 > URL: https://issues.apache.org/jira/browse/HIVE-16761 > Project: Hive > Issue Type: Bug > Reporter: Gopal V > Assignee: Sergey Shelukhin > > {code} > Caused by: java.io.IOException: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.io.BatchToRowReader.next(BatchToRowReader.java:153) > at > org.apache.hadoop.hive.ql.io.BatchToRowReader.next(BatchToRowReader.java:78) > at > org.apache.hadoop.hive.ql.io.HiveContextAwareRecordReader.doNext(HiveContextAwareRecordReader.java:360) > ... 26 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.io.BatchToRowReader.nextString(BatchToRowReader.java:334) > at > org.apache.hadoop.hive.ql.io.BatchToRowReader.nextValue(BatchToRowReader.java:602) > at > org.apache.hadoop.hive.ql.io.BatchToRowReader.next(BatchToRowReader.java:149) > ... 28 more > {code} > {code} > set hive.enforce.sortmergebucketmapjoin=false; > set hive.optimize.bucketmapjoin=true; > set hive.optimize.bucketmapjoin.sortedmerge=true; > set hive.auto.convert.sortmerge.join=true; > set hive.auto.convert.join=true; > set hive.auto.convert.join.noconditionaltask.size=500; > select year,quarter,count(*) from transactions_raw_orc_200 a join > customer_accounts_orc_200 b on a.account_id=b.account_id group by > year,quarter; > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)