[ 
https://issues.apache.org/jira/browse/DRILL-2608?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14384497#comment-14384497
 ] 

Khurram Faraaz commented on DRILL-2608:
---------------------------------------

Physical plan for the failing query

00-00    Screen : rowType = RecordType(ANY key): rowcount = 343.0, cumulative 
cost = {1122.3 rows, 1122.3 cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 12812
00-01      UnionAll(all=[true]) : rowType = RecordType(ANY key): rowcount = 
343.0, cumulative cost = {1088.0 rows, 1088.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 12811
00-03        UnionAll(all=[true]) : rowType = RecordType(ANY key): rowcount = 
138.0, cumulative cost = {540.0 rows, 540.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 12809
00-05          UnionAll(all=[true]) : rowType = RecordType(ANY key): rowcount = 
105.0, cumulative cost = {369.0 rows, 369.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 12807
00-07            UnionAll(all=[true]) : rowType = RecordType(ANY key): rowcount 
= 101.0, cumulative cost = {260.0 rows, 260.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 12805
00-09              UnionAll(all=[true]) : rowType = RecordType(ANY key): 
rowcount = 58.0, cumulative cost = {116.0 rows, 116.0 cpu, 0.0 io, 0.0 network, 
0.0 memory}, id = 12803
00-11                Scan(groupscan=[EasyGroupScan 
[selectionRoot=/tmp/charData.json, numFiles=1, columns=[`key`], 
files=[maprfs:/tmp/charData.json]]]) : rowType = RecordType(ANY key): rowcount 
= 18.0, cumulative cost = {18.0 rows, 18.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 12801
00-10                Scan(groupscan=[EasyGroupScan 
[selectionRoot=/tmp/dateData.json, numFiles=1, columns=[`key`], 
files=[maprfs:/tmp/dateData.json]]]) : rowType = RecordType(ANY key): rowcount 
= 40.0, cumulative cost = {40.0 rows, 40.0 cpu, 0.0 io, 0.0 network, 0.0 
memory}, id = 12802
00-08              Scan(groupscan=[EasyGroupScan 
[selectionRoot=/tmp/doubleData.json, numFiles=1, columns=[`key`], 
files=[maprfs:/tmp/doubleData.json]]]) : rowType = RecordType(ANY key): 
rowcount = 43.0, cumulative cost = {43.0 rows, 43.0 cpu, 0.0 io, 0.0 network, 
0.0 memory}, id = 12804
00-06            Scan(groupscan=[EasyGroupScan 
[selectionRoot=/tmp/intData.json, numFiles=1, columns=[`key`], 
files=[maprfs:/tmp/intData.json]]]) : rowType = RecordType(ANY key): rowcount = 
4.0, cumulative cost = {4.0 rows, 4.0 cpu, 0.0 io, 0.0 network, 0.0 memory}, id 
= 12806
00-04          Scan(groupscan=[EasyGroupScan 
[selectionRoot=/tmp/timeStmpData.json, numFiles=1, columns=[`key`], 
files=[maprfs:/tmp/timeStmpData.json]]]) : rowType = RecordType(ANY key): 
rowcount = 33.0, cumulative cost = {33.0 rows, 33.0 cpu, 0.0 io, 0.0 network, 
0.0 memory}, id = 12808
00-02        Scan(groupscan=[EasyGroupScan [selectionRoot=/tmp/vrChrData.json, 
numFiles=1, columns=[`key`], files=[maprfs:/tmp/vrChrData.json]]]) : rowType = 
RecordType(ANY key): rowcount = 205.0, cumulative cost = {205.0 rows, 205.0 
cpu, 0.0 io, 0.0 network, 0.0 memory}, id = 12810

> Union all query fails when json.all_text_mode=false
> ---------------------------------------------------
>
>                 Key: DRILL-2608
>                 URL: https://issues.apache.org/jira/browse/DRILL-2608
>             Project: Apache Drill
>          Issue Type: Bug
>          Components: Query Planning & Optimization
>    Affects Versions: 0.9.0
>         Environment: | 9d92b8e319f2d46e8659d903d355450e15946533 | DRILL-2580: 
> Exit early from HashJoinBatch if build side is empty | 26.03.2015 @ 16:13:53 
> EDT | Unknown     | 26.03.2015 @ 16:53:21 EDT |
>            Reporter: Khurram Faraaz
>            Assignee: Sean Hsuan-Yi Chu
>
> Union all query over JSON data file fails when store.json.all_text_mode is 
> set to false, and same query returns correct results when 
> store.json.all_text_mode is set to true. Each JSON data file had only one 
> type of object {"key":<value>}, and the values in each of the JSON data files 
> were of same datatype. Test was executed on a 4 node cluster.
> {code}
> 0: jdbc:drill:> select key from `charData.json` union all select key from 
> `dateData.json` union all select key from `doubleData.json` union all select 
> key from `intData.json` union all select key from `timeStmpData.json` union 
> all select key from `vrChrData.json`;
> Query failed: RemoteRpcException: Failure while running fragment., For input 
> string: "itzVxYBb" [ f1f81073-161c-4f24-89e5-37379413b01b on 
> centos-04.qa.lab:31010 ]
> [ f1f81073-161c-4f24-89e5-37379413b01b on centos-04.qa.lab:31010 ]
> Error: exception while executing query: Failure while executing query. 
> (state=,code=0)
> {code}
> Then I set alter session set `store.json.all_text_mode`=true;
> After setting son.all_text_mode to true, union all query returned correct 
> results.
> {code}
> 0: jdbc:drill:> select key from `charData.json` union all select key from 
> `dateData.json` union all select key from `doubleData.json` union all select 
> key from `intData.json` union all select key from `timeStmpData.json` union 
> all select key from `vrChrData.json`;
> ...
> +------------+
> 7,194 rows selected (0.462 seconds)
> {code}
> Resetting it back to false gives the same Exception
> {code}
> 0: jdbc:drill:> alter session set `store.json.all_text_mode`=false;
> +------------+------------+
> |     ok     |  summary   |
> +------------+------------+
> | true       | store.json.all_text_mode updated. |
> +------------+------------+
> 1 row selected (0.049 seconds)
> 0: jdbc:drill:> select key from `charData.json` union all select key from 
> `dateData.json` union all select key from `doubleData.json` union all select 
> key from `intData.json` union all select key from `timeStmpData.json` union 
> all select key from `vrChrData.json`;
> Query failed: RemoteRpcException: Failure while running fragment., For input 
> string: "itzVxYBb" [ 412eda0e-cc22-43ae-b763-5e40a0326551 on 
> centos-04.qa.lab:31010 ]
> [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> Error: exception while executing query: Failure while executing query. 
> (state=,code=0)
> {code}
> Stack trace from drillbit.log
> {code}
> 2015-03-27 18:30:56,620 [2aea5e1e-88b9-3e4e-07b5-d7e46b29756f:frag:0:0] ERROR 
> o.a.drill.exec.work.foreman.Foreman - Error 
> b9cb90bd-7d89-4061-8595-4c5ad983f3f3: RemoteRpcException: Failure while 
> running fragment., For input string: "itzVxYBb" [ 
> 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> org.apache.drill.exec.rpc.RemoteRpcException: Failure while running 
> fragment., For input string: "itzVxYBb" [ 
> 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
> [ 412eda0e-cc22-43ae-b763-5e40a0326551 on centos-04.qa.lab:31010 ]
>         at 
> org.apache.drill.exec.work.foreman.QueryManager.statusUpdate(QueryManager.java:163)
>  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> org.apache.drill.exec.work.foreman.QueryManager$RootStatusReporter.statusChange(QueryManager.java:281)
>  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:114)
>  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> org.apache.drill.exec.work.fragment.AbstractStatusReporter.fail(AbstractStatusReporter.java:110)
>  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> org.apache.drill.exec.work.fragment.FragmentExecutor.internalFail(FragmentExecutor.java:230)
>  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> org.apache.drill.exec.work.fragment.FragmentExecutor.run(FragmentExecutor.java:182)
>  [drill-java-exec-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> org.apache.drill.common.SelfCleaningRunnable.run(SelfCleaningRunnable.java:38)
>  [drill-common-0.9.0-SNAPSHOT-rebuffed.jar:0.9.0-SNAPSHOT]
>         at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)
>  [na:1.7.0_75]
>         at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)
>  [na:1.7.0_75]
>         at java.lang.Thread.run(Thread.java:745) [na:1.7.0_75]
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to