Chun Chang created DRILL-2986: --------------------------------- Summary: IOBException query multiple files that contain schema changes between files Key: DRILL-2986 URL: https://issues.apache.org/jira/browse/DRILL-2986 Project: Apache Drill Issue Type: Bug Components: Execution - Data Types Affects Versions: 1.0.0 Reporter: Chun Chang Assignee: Daniel Barclay (Drill)
{code} 0: jdbc:drill:schema=dfs.drillTestDirComplexP> select * from sys.version; +------------+----------------+-------------+-------------+------------+ | commit_id | commit_message | commit_time | build_email | build_time | +------------+----------------+-------------+-------------+------------+ | 31e51832db216ca16525af83abd445b812c569c4 | DRILL-2963: Fix NestedLoopJoinBatch when left batch is empty | 06.05.2015 @ 14:21:57 EDT | Unknown | 06.05.2015 @ 18:04:15 EDT | +------------+----------------+-------------+-------------+------------+ {code} The following query (Advanced/Passing/complextype/json/complex313.q) read from four files in a dir. Between files, there is schema changes. {code} 0: jdbc:drill:schema=dfs.drillTestDirComplexP> select * from dfs.`/drill/test*/comp[a-l]ex_type/json/jira1*.json`; +------------+------------+------------+------------+------------+ | dir0 | dir1 | dir2 | id | oooa | +------------+------------+------------+------------+------------+ | testdata | complex_type | json | 2 | {"oa":{"oab":{"oabc":[{"rowId":2},{"rowValue1":2,"rowValue2":2}]}}} | | testdata | complex_type | json | 2 | null | java.lang.RuntimeException: java.sql.SQLException: Failure while executing query. at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514) at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148) at sqlline.SqlLine.print(SqlLine.java:1809) at sqlline.SqlLine$Commands.execute(SqlLine.java:3766) at sqlline.SqlLine$Commands.sql(SqlLine.java:3663) at sqlline.SqlLine.dispatch(SqlLine.java:889) at sqlline.SqlLine.begin(SqlLine.java:763) at sqlline.SqlLine.start(SqlLine.java:498) at sqlline.SqlLine.main(SqlLine.java:460) {code} drill log: {code} 2015-05-07 15:01:11,570 [2ab41f58-4a68-9384-2310-1853a36405a1:foreman] INFO o.a.d.e.s.schedule.BlockMapBuilder - Get block maps: Executed 4 out of 4 using 4 threads. Time: 2ms total, 1.126568ms avg, 1ms max. 2015-05-07 15:01:11,577 [2ab41f58-4a68-9384-2310-1853a36405a1:foreman] INFO o.a.drill.exec.work.foreman.Foreman - State change requested. PENDING --> RUNNING 2015-05-07 15:01:11,593 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from AWAITING_ALLOCATION --> RUNNING for 2015-05-07 15:01:11,593 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.d.e.w.f.AbstractStatusReporter - State changed for 2ab41f58-4a68-9384-2310-1853a36405a1:0:0. New state: RUNNING 2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16] 2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16] 2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16] 2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16] 2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16] 2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16] 2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.d.c.e.DrillRuntimeException - User Error Occurred org.apache.drill.common.exceptions.UserException: DATA_READ ERROR: index: 0, length: 4 (expected: range(0, 0)) Line 11 Column 48 Field rowValue1 [Error Id: 5cd7d26d-38a8-45dd-b7ae-4825cae19c37 ] at org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:465) ~[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:512) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:305) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:470) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:305) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeDataSwitch(JsonReader.java:242) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeToVector(JsonReader.java:180) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.write(JsonReader.java:146) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.store.easy.json.JSONRecordReader.next(JSONRecordReader.java:194) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:175) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:83) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.ScreenCreator$ScreenRoot.innerNext(ScreenCreator.java:80) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:73) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:199) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:193) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at java.security.AccessController.doPrivileged(Native Method) [na:1.7.0_45] at javax.security.auth.Subject.doAs(Subject.java:415) [na:1.7.0_45] at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1469) [hadoop-common-2.4.1-mapr-1408.jar:na] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:193) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_45] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_45] at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45] Caused by: java.lang.IndexOutOfBoundsException: index: 0, length: 4 (expected: range(0, 0)) at io.netty.buffer.DrillBuf.checkIndexD(DrillBuf.java:189) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final] at io.netty.buffer.DrillBuf.chk(DrillBuf.java:211) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final] at io.netty.buffer.DrillBuf.getInt(DrillBuf.java:491) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final] at org.apache.drill.exec.vector.UInt4Vector$Accessor.get(UInt4Vector.java:300) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.EmptyValuePopulator.populate(EmptyValuePopulator.java:46) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.RepeatedMapVector$Mutator.setValueCount(RepeatedMapVector.java:534) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.impl.RepeatedMapWriter.start(RepeatedMapWriter.java:169) ~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:278) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:470) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] ... 25 common frames omitted 2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from RUNNING --> FAILED for 2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from FAILED --> FINISHED for 2015-05-07 15:01:11,636 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.work.foreman.Foreman - State change requested. RUNNING --> FAILED org.apache.drill.common.exceptions.UserRemoteException: DATA_READ ERROR: index: 0, length: 4 (expected: range(0, 0)) File /drill/testdata/complex_type/json/jira1962a.json Record 1 Line 11 Column 48 Field rowValue1 Line 11 Column 48 Field rowValue1 Fragment 0:0 [Error Id: 5cd7d26d-38a8-45dd-b7ae-4825cae19c37 on qa-node119.qa.lab:31010] at org.apache.drill.exec.work.foreman.QueryManager$1.statusUpdate(QueryManager.java:409) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:389) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:90) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:86) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:266) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232) [drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38) [drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) [na:1.7.0_45] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) [na:1.7.0_45] at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45] 2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0: State change requested from FAILED --> CANCELLATION_REQUESTED for 2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] WARN o.a.d.e.w.fragment.FragmentExecutor - Ignoring unexpected state transition FAILED => CANCELLATION_REQUESTED. 2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.work.foreman.Foreman - foreman cleaning up. 2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO o.a.drill.exec.work.foreman.Foreman - State change requested. FAILED --> COMPLETED 2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] WARN o.a.drill.exec.work.foreman.Foreman - Dropping request to move to COMPLETED state as query is already at FAILED state (which is terminal). {code} plan: {code} 0: jdbc:drill:schema=dfs.drillTestDirComplexP> explain plan for select * from dfs.`/drill/test*/comp[a-l]ex_type/json/jira1*.json`; +------------+------------+ | text | json | +------------+------------+ | 00-00 Screen 00-01 Scan(groupscan=[EasyGroupScan [selectionRoot=/drill, numFiles=4, columns=[`*`], files=[maprfs:/drill/testdata/complex_type/json/jira1894.json, maprfs:/drill/testdata/complex_type/json/jira1893.json, maprfs:/drill/testdata/complex_type/json/jira1962a.json, maprfs:/drill/testdata/complex_type/json/jira1962b.json]]]) {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)