Chun Chang created DRILL-2986:
---------------------------------
Summary: IOBException query multiple files that contain schema
changes between files
Key: DRILL-2986
URL: https://issues.apache.org/jira/browse/DRILL-2986
Project: Apache Drill
Issue Type: Bug
Components: Execution - Data Types
Affects Versions: 1.0.0
Reporter: Chun Chang
Assignee: Daniel Barclay (Drill)
{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexP> select * from sys.version;
+------------+----------------+-------------+-------------+------------+
| commit_id | commit_message | commit_time | build_email | build_time |
+------------+----------------+-------------+-------------+------------+
| 31e51832db216ca16525af83abd445b812c569c4 | DRILL-2963: Fix
NestedLoopJoinBatch when left batch is empty | 06.05.2015 @ 14:21:57 EDT |
Unknown | 06.05.2015 @ 18:04:15 EDT |
+------------+----------------+-------------+-------------+------------+
{code}
The following query (Advanced/Passing/complextype/json/complex313.q) read from
four files in a dir. Between files, there is schema changes.
{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexP> select * from
dfs.`/drill/test*/comp[a-l]ex_type/json/jira1*.json`;
+------------+------------+------------+------------+------------+
| dir0 | dir1 | dir2 | id | oooa |
+------------+------------+------------+------------+------------+
| testdata | complex_type | json | 2 |
{"oa":{"oab":{"oabc":[{"rowId":2},{"rowValue1":2,"rowValue2":2}]}}} |
| testdata | complex_type | json | 2 | null |
java.lang.RuntimeException: java.sql.SQLException: Failure while executing
query.
at sqlline.SqlLine$IncrementalRows.hasNext(SqlLine.java:2514)
at sqlline.SqlLine$TableOutputFormat.print(SqlLine.java:2148)
at sqlline.SqlLine.print(SqlLine.java:1809)
at sqlline.SqlLine$Commands.execute(SqlLine.java:3766)
at sqlline.SqlLine$Commands.sql(SqlLine.java:3663)
at sqlline.SqlLine.dispatch(SqlLine.java:889)
at sqlline.SqlLine.begin(SqlLine.java:763)
at sqlline.SqlLine.start(SqlLine.java:498)
at sqlline.SqlLine.main(SqlLine.java:460)
{code}
drill log:
{code}
2015-05-07 15:01:11,570 [2ab41f58-4a68-9384-2310-1853a36405a1:foreman] INFO
o.a.d.e.s.schedule.BlockMapBuilder - Get block maps: Executed 4 out of 4 using
4 threads. Time: 2ms total, 1.126568ms avg, 1ms max.
2015-05-07 15:01:11,577 [2ab41f58-4a68-9384-2310-1853a36405a1:foreman] INFO
o.a.drill.exec.work.foreman.Foreman - State change requested. PENDING -->
RUNNING
2015-05-07 15:01:11,593 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0:
State change requested from AWAITING_ALLOCATION --> RUNNING for
2015-05-07 15:01:11,593 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.d.e.w.f.AbstractStatusReporter - State changed for
2ab41f58-4a68-9384-2310-1853a36405a1:0:0. New state: RUNNING
2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,595 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,615 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.vector.UInt4Vector - Realloc vector null. [8] -> [16]
2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.d.c.e.DrillRuntimeException - User Error Occurred
org.apache.drill.common.exceptions.UserException: DATA_READ ERROR: index: 0,
length: 4 (expected: range(0, 0))
Line 11
Column 48
Field rowValue1
[Error Id: 5cd7d26d-38a8-45dd-b7ae-4825cae19c37 ]
at
org.apache.drill.common.exceptions.UserException$Builder.build(UserException.java:465)
~[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:512)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:305)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:470)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:305)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:309)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeDataSwitch(JsonReader.java:242)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeToVector(JsonReader.java:180)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.write(JsonReader.java:146)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.store.easy.json.JSONRecordReader.next(JSONRecordReader.java:194)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.physical.impl.ScanBatch.next(ScanBatch.java:175)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.physical.impl.validate.IteratorValidatorBatchIterator.next(IteratorValidatorBatchIterator.java:118)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:83)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.physical.impl.ScreenCreator$ScreenRoot.innerNext(ScreenCreator.java:80)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.physical.impl.BaseRootExec.next(BaseRootExec.java:73)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:199)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.fragment.FragmentExecutor$1.run(FragmentExecutor.java:193)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at java.security.AccessController.doPrivileged(Native Method)
[na:1.7.0_45]
at javax.security.auth.Subject.doAs(Subject.java:415) [na:1.7.0_45]
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1469)
[hadoop-common-2.4.1-mapr-1408.jar:na]
at
org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:193)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
[na:1.7.0_45]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
Caused by: java.lang.IndexOutOfBoundsException: index: 0, length: 4 (expected:
range(0, 0))
at io.netty.buffer.DrillBuf.checkIndexD(DrillBuf.java:189)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final]
at io.netty.buffer.DrillBuf.chk(DrillBuf.java:211)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final]
at io.netty.buffer.DrillBuf.getInt(DrillBuf.java:491)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:4.0.27.Final]
at
org.apache.drill.exec.vector.UInt4Vector$Accessor.get(UInt4Vector.java:300)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.EmptyValuePopulator.populate(EmptyValuePopulator.java:46)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.RepeatedMapVector$Mutator.setValueCount(RepeatedMapVector.java:534)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.impl.RepeatedMapWriter.start(RepeatedMapWriter.java:169)
~[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:278)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.vector.complex.fn.JsonReader.writeData(JsonReader.java:470)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
... 25 common frames omitted
2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0:
State change requested from RUNNING --> FAILED for
2015-05-07 15:01:11,635 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0:
State change requested from FAILED --> FINISHED for
2015-05-07 15:01:11,636 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.work.foreman.Foreman - State change requested. RUNNING -->
FAILED
org.apache.drill.common.exceptions.UserRemoteException: DATA_READ ERROR: index:
0, length: 4 (expected: range(0, 0))
File /drill/testdata/complex_type/json/jira1962a.json
Record 1
Line 11
Column 48
Field rowValue1
Line 11
Column 48
Field rowValue1
Fragment 0:0
[Error Id: 5cd7d26d-38a8-45dd-b7ae-4825cae19c37 on qa-node119.qa.lab:31010]
at
org.apache.drill.exec.work.foreman.QueryManager$1.statusUpdate(QueryManager.java:409)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:389)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:90)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:86)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.fragment.FragmentExecutor.sendFinalState(FragmentExecutor.java:266)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:232)
[drill-java-exec-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
[drill-common-1.0.0-SNAPSHOT-rebuffed.jar:1.0.0-SNAPSHOT]
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
[na:1.7.0_45]
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
[na:1.7.0_45]
at java.lang.Thread.run(Thread.java:744) [na:1.7.0_45]
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.d.e.w.fragment.FragmentExecutor - 2ab41f58-4a68-9384-2310-1853a36405a1:0:0:
State change requested from FAILED --> CANCELLATION_REQUESTED for
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] WARN
o.a.d.e.w.fragment.FragmentExecutor - Ignoring unexpected state transition
FAILED => CANCELLATION_REQUESTED.
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.work.foreman.Foreman - foreman cleaning up.
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] INFO
o.a.drill.exec.work.foreman.Foreman - State change requested. FAILED -->
COMPLETED
2015-05-07 15:01:11,647 [2ab41f58-4a68-9384-2310-1853a36405a1:frag:0:0] WARN
o.a.drill.exec.work.foreman.Foreman - Dropping request to move to COMPLETED
state as query is already at FAILED state (which is terminal).
{code}
plan:
{code}
0: jdbc:drill:schema=dfs.drillTestDirComplexP> explain plan for select * from
dfs.`/drill/test*/comp[a-l]ex_type/json/jira1*.json`;
+------------+------------+
| text | json |
+------------+------------+
| 00-00 Screen
00-01 Scan(groupscan=[EasyGroupScan [selectionRoot=/drill, numFiles=4,
columns=[`*`], files=[maprfs:/drill/testdata/complex_type/json/jira1894.json,
maprfs:/drill/testdata/complex_type/json/jira1893.json,
maprfs:/drill/testdata/complex_type/json/jira1962a.json,
maprfs:/drill/testdata/complex_type/json/jira1962b.json]]])
{code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)