[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15406888#comment-15406888 ] Prasanth Jayachandran commented on HIVE-14355: -- Committed to branch-2.1 as well. > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Fix For: 2.2.0, 2.1.1 > > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch, > HIVE-14355.2.java-only.patch, HIVE-14355.2.patch, > HIVE-14355.3.java-only.patch, HIVE-14355.3.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) > ... 19 more > {code} > {code:title=When vectorization is disabled} > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.ReduceSinkOperator: Using tag = -1 > 2016-07-27T01:52:43,328 INFO
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15400463#comment-15400463 ] Hive QA commented on HIVE-14355: Here are the results of testing the latest attachment: https://issues.apache.org/jira/secure/attachment/12820897/HIVE-14355.3.patch {color:red}ERROR:{color} -1 due to no test(s) being added or modified. {color:red}ERROR:{color} -1 due to 9 failed/errored test(s), 10418 tests executed *Failed tests:* {noformat} TestMsgBusConnection - did not produce a TEST-*.xml file org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_avro_nullable_union org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_list_bucket_dml_12 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_list_bucket_dml_13 org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_stats_list_bucket org.apache.hadoop.hive.cli.TestCliDriver.testCliDriver_subquery_multiinsert org.apache.hadoop.hive.cli.TestNegativeCliDriver.testNegativeCliDriver_avro_non_nullable_union org.apache.hadoop.hive.metastore.TestHiveMetaStoreTxns.stringifyValidTxns org.apache.hadoop.hive.metastore.TestHiveMetaStoreTxns.testTxnRange {noformat} Test results: https://builds.apache.org/job/PreCommit-HIVE-MASTER-Build/692/testReport Console output: https://builds.apache.org/job/PreCommit-HIVE-MASTER-Build/692/console Test logs: http://ec2-204-236-174-241.us-west-1.compute.amazonaws.com/logs/PreCommit-HIVE-MASTER-Build-692/ Messages: {noformat} Executing org.apache.hive.ptest.execution.TestCheckPhase Executing org.apache.hive.ptest.execution.PrepPhase Executing org.apache.hive.ptest.execution.ExecutionPhase Executing org.apache.hive.ptest.execution.ReportingPhase Tests exited with: TestsFailedException: 9 tests failed {noformat} This message is automatically generated. ATTACHMENT ID: 12820897 - PreCommit-HIVE-MASTER-Build > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch, > HIVE-14355.2.java-only.patch, HIVE-14355.2.patch, > HIVE-14355.3.java-only.patch, HIVE-14355.3.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266)
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15399874#comment-15399874 ] Sergey Shelukhin commented on HIVE-14355: - +1 > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch, > HIVE-14355.2.java-only.patch, HIVE-14355.2.patch, > HIVE-14355.3.java-only.patch, HIVE-14355.3.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) > ... 19 more > {code} > {code:title=When vectorization is disabled} > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.ReduceSinkOperator: Using tag = -1 > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.OperatorUtils: Setting
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15399840#comment-15399840 ] Prasanth Jayachandran commented on HIVE-14355: -- [~sershe] Can you take a look again plz? > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch, > HIVE-14355.2.java-only.patch, HIVE-14355.2.patch, > HIVE-14355.3.java-only.patch, HIVE-14355.3.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) > ... 19 more > {code} > {code:title=When vectorization is disabled} > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.ReduceSinkOperator: Using tag = -1 > 2016-07-27T01:52:43,328 INFO [TezTaskRunner >
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15398304#comment-15398304 ] Prasanth Jayachandran commented on HIVE-14355: -- Addressed [~sershe]'s review comments. > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch, > HIVE-14355.2.java-only.patch, HIVE-14355.2.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) > ... 19 more > {code} > {code:title=When vectorization is disabled} > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.ReduceSinkOperator: Using tag = -1 > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.OperatorUtils: Setting output >
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15398046#comment-15398046 ] Sergey Shelukhin commented on HIVE-14355: - Some comments around code structure > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) > ... 19 more > {code} > {code:title=When vectorization is disabled} > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.ReduceSinkOperator: Using tag = -1 > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.OperatorUtils: Setting output > collector: RS[4] --> Reducer 2 > 2016-07-27T01:52:43,329 ERROR
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396866#comment-15396866 ] Prasanth Jayachandran commented on HIVE-14355: -- [~sershe] can you plz take a look at the changes? To summarize - LlapInputFormat disables llap io if it finds unsafe schema evolution - OrcEncodedDataConsumer has to known the file schema which it gets from OrcEncodedDataReader and reader schema from job conf. Both are required to create schema evolution object. - This check can happen only at runtime because we don't know the file schema at compile time > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) >
[jira] [Commented] (HIVE-14355) Schema evolution for ORC in llap is broken for int to string conversion
[ https://issues.apache.org/jira/browse/HIVE-14355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15396864#comment-15396864 ] Prasanth Jayachandran commented on HIVE-14355: -- This patch also depends on HIVE-14310 changes. > Schema evolution for ORC in llap is broken for int to string conversion > --- > > Key: HIVE-14355 > URL: https://issues.apache.org/jira/browse/HIVE-14355 > Project: Hive > Issue Type: Bug >Affects Versions: 2.1.0, 2.2.0 >Reporter: Prasanth Jayachandran >Assignee: Prasanth Jayachandran >Priority: Critical > Attachments: HIVE-14355-java-only.patch, HIVE-14355.1.patch > > > When schema is evolved from any integer type to string then following > exceptions are thrown in LLAP (Works fine in Tez). I guess this should happen > even for other conversions. > {code} > hive> create table orc_integer(b bigint) stored as orc; > hive> insert into orc_integer values(100); > hive> select count(*) from orc_integer where b=100; > OK > 1 > hive> alter table orc_integer change column b b string; > hive> select count(*) from orc_integer where b=100; > // FAIL with following exception > {code} > {code:title=When vectorization is enabled} > 2016-07-27T01:48:05,611 INFO [TezTaskRunner ()] > vector.VectorReduceSinkOperator: RECORDS_OUT_INTERMEDIATE_Map_1:0, > 2016-07-27T01:48:05,611 ERROR [TezTaskRunner ()] tez.TezProcessor: > java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: > Hive Runtime Error while processing row > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:95) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.pushRecord(MapRecordSource.java:70) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordProcessor.run(MapRecordProcessor.java:393) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(TezProcessor.java:185) > at > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(TezProcessor.java:168) > at > org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run(LogicalIOProcessorRuntimeTask.java:370) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:73) > at > org.apache.tez.runtime.task.TaskRunner2Callable$1.run(TaskRunner2Callable.java:61) > at java.security.AccessController.doPrivileged(Native Method) > at javax.security.auth.Subject.doAs(Subject.java:422) > at > org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:61) > at > org.apache.tez.runtime.task.TaskRunner2Callable.callInternal(TaskRunner2Callable.java:37) > at org.apache.tez.common.CallableWithNdc.call(CallableWithNdc.java:36) > at > org.apache.hadoop.hive.llap.daemon.impl.StatsRecordingThreadPool$WrappedCallable.call(StatsRecordingThreadPool.java:110) > at java.util.concurrent.FutureTask.run(FutureTask.java:266) > at > java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) > at > java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) > at java.lang.Thread.run(Thread.java:745) > Caused by: org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime > Error while processing row > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:866) > at > org.apache.hadoop.hive.ql.exec.tez.MapRecordSource.processRow(MapRecordSource.java:86) > ... 18 more > Caused by: java.lang.ClassCastException: > org.apache.hadoop.hive.ql.exec.vector.LongColumnVector cannot be cast to > org.apache.hadoop.hive.ql.exec.vector.BytesColumnVector > at > org.apache.hadoop.hive.ql.exec.vector.expressions.gen.FilterStringGroupColEqualStringGroupScalarBase.evaluate(FilterStringGroupColEqualStringGroupScalarBase.java:42) > at > org.apache.hadoop.hive.ql.exec.vector.VectorFilterOperator.process(VectorFilterOperator.java:110) > at org.apache.hadoop.hive.ql.exec.Operator.forward(Operator.java:879) > at > org.apache.hadoop.hive.ql.exec.TableScanOperator.process(TableScanOperator.java:130) > at > org.apache.hadoop.hive.ql.exec.vector.VectorMapOperator.process(VectorMapOperator.java:774) > ... 19 more > {code} > {code:title=When vectorization is disabled} > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.ReduceSinkOperator: Using tag = -1 > 2016-07-27T01:52:43,328 INFO [TezTaskRunner > (1469608604787_0002_26_00_00_0)] exec.OperatorUtils: Setting output > collector: RS[4] --> Reducer 2 >