[ https://issues.apache.org/jira/browse/TEZ-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14184744#comment-14184744 ]
Mostafa Mokhtar commented on TEZ-1702: -------------------------------------- [~rajesh.balamohan] Yarn won't let me dump the logs. You can find them here though http://cn106-10.l42scl.hortonworks.com:19888/jobhistory/logs/cn107-10.l42scl.hortonworks.com:45454/container_1414029100044_0150_01_000001/container_1414029100044_0150_01_000001/mmokhtar > Hive : With Auto reduce parallelism enabled TPC-DS query 31 gets stuck in > Reducer 12 > ------------------------------------------------------------------------------------- > > Key: TEZ-1702 > URL: https://issues.apache.org/jira/browse/TEZ-1702 > Project: Apache Tez > Issue Type: Bug > Affects Versions: 0.5.2 > Reporter: Mostafa Mokhtar > Priority: Critical > Fix For: 0.5.2 > > Attachments: Logs for container_1414029100044_0150_01_000001.zip > > > Issue found in branch-0.5 , with latest commit as > {code} > commit 2e65de88af709d30207403fea881b697a4853dd6 > Author: Bikas Saha <bi...@apache.org> > Date: Tue Oct 21 14:59:56 2014 -0700 > {code} > Running TPC-DS Query 31 with Auto reduce parallelism enabled the query gets > stuck in Reducer 12 > Call Stack for stuck thread > {code} > Thread 14575: (state = BLOCKED) > - sun.misc.Unsafe.park(boolean, long) @bci=0 (Interpreted frame) > - java.util.concurrent.locks.LockSupport.park(java.lang.Object) @bci=14, > line=186 (Interpreted frame) > - > java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject.await() > @bci=42, line=2043 (Interpreted frame) > - java.util.concurrent.LinkedBlockingQueue.take() @bci=29, line=442 > (Interpreted frame) > - > org.apache.tez.runtime.library.shuffle.common.impl.ShuffleManager.getNextInput() > @bci=67, line=663 (Interpreted frame) > - > org.apache.tez.runtime.library.common.readers.UnorderedKVReader.moveToNextInput() > @bci=26, line=176 (Interpreted frame) > - org.apache.tez.runtime.library.common.readers.UnorderedKVReader.next() > @bci=30, line=117 (Interpreted frame) > - > org.apache.hadoop.hive.ql.exec.tez.HashTableLoader.load(org.apache.hadoop.hive.ql.exec.persistence.MapJoinTableContainer[], > org.apache.hadoop.hive.ql.exec.persistence.MapJoinTableContainerSerDe[]) > @bci=259, line=112 (Compiled frame) > - org.apache.hadoop.hive.ql.exec.MapJoinOperator.loadHashTable() @bci=86, > line=190 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.MapJoinOperator.processOp(java.lang.Object, > int) @bci=12, line=244 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.Operator.forward(java.lang.Object, > org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=63, > line=815 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.SelectOperator.processOp(java.lang.Object, > int) @bci=121, line=84 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.Operator.forward(java.lang.Object, > org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=63, > line=815 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.GroupByOperator.forward(java.lang.Object[], > org.apache.hadoop.hive.ql.udf.generic.GenericUDAFEvaluator$AggregationBuffer[]) > @bci=97, line=1072 (Interpreted frame) > - > org.apache.hadoop.hive.ql.exec.GroupByOperator.processAggr(java.lang.Object, > org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector, > org.apache.hadoop.hive.ql.exec.KeyWrapper) @bci=71, line=881 (Interpreted > frame) > - > org.apache.hadoop.hive.ql.exec.GroupByOperator.processKey(java.lang.Object, > org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=34, > line=741 (Interpreted frame) > > > 222,0-1 79% > - > org.apache.hadoop.hive.ql.exec.GroupByOperator.processAggr(java.lang.Object, > org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector, > org.apache.hadoop.hive.ql.exec.KeyWrapper) @bci=71, line=881 (Interpreted > frame) > - > org.apache.hadoop.hive.ql.exec.GroupByOperator.processKey(java.lang.Object, > org.apache.hadoop.hive.serde2.objectinspector.ObjectInspector) @bci=34, > line=741 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.GroupByOperator.processOp(java.lang.Object, > int) @bci=457, line=809 (Interpreted frame) > - > org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.processKeyValues(java.lang.Iterable, > byte) @bci=174, line=308 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.tez.ReduceRecordSource.pushRecord() > @bci=218, line=252 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.tez.ReduceRecordProcessor.run() @bci=155, > line=168 (Interpreted frame) > - > org.apache.hadoop.hive.ql.exec.tez.TezProcessor.initializeAndRunProcessor(java.util.Map, > java.util.Map) @bci=224, line=163 (Interpreted frame) > - org.apache.hadoop.hive.ql.exec.tez.TezProcessor.run(java.util.Map, > java.util.Map) @bci=86, line=138 (Interpreted frame) > - org.apache.tez.runtime.LogicalIOProcessorRuntimeTask.run() @bci=76, > line=324 (Interpreted frame) > - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run() > @bci=119, line=180 (Interpreted frame) > - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable$1.run() > @bci=1, line=172 (Interpreted frame) > - > java.security.AccessController.doPrivileged(java.security.PrivilegedExceptionAction, > java.security.AccessControlContext) @bci=0 (Interpreted frame) > - javax.security.auth.Subject.doAs(javax.security.auth.Subject, > java.security.PrivilegedExceptionAction) @bci=42, line=415 (Interpreted frame) > - > org.apache.hadoop.security.UserGroupInformation.doAs(java.security.PrivilegedExceptionAction) > @bci=14, line=1548 (Interpreted frame) > - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call() > @bci=15, line=172 (Interpreted frame) > - org.apache.tez.runtime.task.TezTaskRunner$TaskRunnerCallable.call() > @bci=1, line=167 (Interpreted frame) > - java.util.concurrent.FutureTask.run() @bci=42, line=262 (Interpreted frame) > - > java.util.concurrent.ThreadPoolExecutor.runWorker(java.util.concurrent.ThreadPoolExecutor$Worker) > @bci=95, line=1145 (Interpreted frame) > - java.util.concurrent.ThreadPoolExecutor$Worker.run() @bci=5, line=615 > (Interpreted frame) > - java.lang.Thread.run() @bci=11, line=744 (Interpreted frame) > {code} > Plan > {code} > STAGE DEPENDENCIES: > Stage-1 is a root stage > Stage-0 depends on stages: Stage-1 > STAGE PLANS: > Stage: Stage-1 > Tez > Edges: > Map 1 <- Map 10 (BROADCAST_EDGE), Map 9 (BROADCAST_EDGE) > Map 11 <- Map 4 (BROADCAST_EDGE), Map 7 (BROADCAST_EDGE) > Map 14 <- Map 22 (BROADCAST_EDGE), Map 8 (BROADCAST_EDGE) > Map 16 <- Map 25 (BROADCAST_EDGE), Map 6 (BROADCAST_EDGE) > Map 18 <- Map 24 (BROADCAST_EDGE), Map 5 (BROADCAST_EDGE) > Map 20 <- Map 23 (BROADCAST_EDGE), Map 3 (BROADCAST_EDGE) > Reducer 12 <- Map 11 (SIMPLE_EDGE), Reducer 15 (BROADCAST_EDGE), > Reducer 17 (BROADCAST_EDGE), Reducer 19 (BROADCAST_EDGE), Reducer 2 > (BROADCAST_EDGE), Reducer 21 (BROADCAST_EDGE) > Reducer 13 <- Reducer 12 (SIMPLE_EDGE) > Reducer 15 <- Map 14 (SIMPLE_EDGE) > Reducer 17 <- Map 16 (SIMPLE_EDGE) > Reducer 19 <- Map 18 (SIMPLE_EDGE) > Reducer 2 <- Map 1 (SIMPLE_EDGE) > Reducer 21 <- Map 20 (SIMPLE_EDGE) > DagName: mmokhtar_20141026170505_6a5353ab-9f03-4d19-8ed3-ef15fbb35294:1 > Vertices: > Map 1 > Map Operator Tree: > TableScan > alias: store_sales > filterExpr: ss_addr_sk is not null (type: boolean) > Statistics: Num rows: 550076554 Data size: 47370018896 > Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ss_addr_sk is not null (type: boolean) > Statistics: Num rows: 537118738 Data size: 6344204488 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ss_addr_sk (type: int), ss_ext_sales_price > (type: float), ss_sold_date_sk (type: int) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 537118738 Data size: 6344204488 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 > keys: > 0 _col2 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0, _col1 > input vertices: > 1 Map 9 > Statistics: Num rows: 600476053 Data size: 4803808424 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col1} > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col7 > input vertices: > 1 Map 10 > Statistics: Num rows: 600476032 Data size: > 58846651136 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col7 (type: string), 1 (type: int), > 1998 (type: int), _col1 (type: float) > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 600476032 Data size: > 58846651136 Basic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col3) > keys: _col0 (type: string), _col1 (type: int), > _col2 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 589 Data size: 4712 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string), _col1 > (type: int), _col2 (type: int) > sort order: +++ > Map-reduce partition columns: _col0 (type: > string), _col1 (type: int), _col2 (type: int) > Statistics: Num rows: 589 Data size: 4712 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col3 (type: double) > Execution mode: vectorized > Map 10 > Map Operator Tree: > TableScan > alias: customer_address > filterExpr: (ca_address_sk is not null and ca_county is not > null) (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (ca_address_sk is not null and ca_county is > not null) (type: boolean) > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_county > (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 81600000 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string) > Execution mode: vectorized > Map 11 > Map Operator Tree: > TableScan > alias: store_sales > filterExpr: ss_addr_sk is not null (type: boolean) > Statistics: Num rows: 550076554 Data size: 47370018896 > Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ss_addr_sk is not null (type: boolean) > Statistics: Num rows: 537118738 Data size: 6344204488 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ss_addr_sk (type: int), ss_ext_sales_price > (type: float), ss_sold_date_sk (type: int) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 537118738 Data size: 6344204488 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 > keys: > 0 _col2 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0, _col1 > input vertices: > 1 Map 4 > Statistics: Num rows: 600476032 Data size: 4803808256 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col1} > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col7 > input vertices: > 1 Map 7 > Statistics: Num rows: 600476032 Data size: > 58846651136 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col7 (type: string), 2 (type: int), > 1998 (type: int), _col1 (type: float) > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 600476032 Data size: > 58846651136 Basic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col3) > keys: _col0 (type: string), _col1 (type: int), > _col2 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 589 Data size: 4712 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string), _col1 > (type: int), _col2 (type: int) > sort order: +++ > Map-reduce partition columns: _col0 (type: > string), _col1 (type: int), _col2 (type: int) > Statistics: Num rows: 589 Data size: 4712 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col3 (type: double) > Execution mode: vectorized > Map 14 > Map Operator Tree: > TableScan > alias: web_sales > filterExpr: ws_bill_addr_sk is not null (type: boolean) > Statistics: Num rows: 143966864 Data size: 19001610332 > Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ws_bill_addr_sk is not null (type: boolean) > Statistics: Num rows: 143949044 Data size: 1727244892 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ws_bill_addr_sk (type: int), > ws_ext_sales_price (type: float), ws_sold_date_sk (type: int) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 143949044 Data size: 1727244892 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 > keys: > 0 _col2 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0, _col1 > input vertices: > 1 Map 8 > Statistics: Num rows: 600476032 Data size: 4803808256 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int), _col1 (type: > float), 1998 (type: int), 1 (type: int) > outputColumnNames: _col0, _col1, _col4, _col5 > Statistics: Num rows: 600476032 Data size: > 9607616512 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col1} {_col4} {_col5} > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col3, _col6, _col7 > input vertices: > 0 Map 22 > Statistics: Num rows: 600476032 Data size: > 7205712384 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col1 (type: string), _col7 (type: > int), _col6 (type: int), _col3 (type: float) > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 600476032 Data size: > 7205712384 Basic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col3) > keys: _col0 (type: string), _col1 (type: > int), _col2 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 73 Data size: 584 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string), > _col1 (type: int), _col2 (type: int) > sort order: +++ > Map-reduce partition columns: _col0 (type: > string), _col1 (type: int), _col2 (type: int) > Statistics: Num rows: 73 Data size: 584 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col3 (type: double) > Execution mode: vectorized > Map 16 > Map Operator Tree: > TableScan > alias: web_sales > filterExpr: ws_bill_addr_sk is not null (type: boolean) > Statistics: Num rows: 143966864 Data size: 19001610332 > Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ws_bill_addr_sk is not null (type: boolean) > Statistics: Num rows: 143949044 Data size: 1727244892 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ws_bill_addr_sk (type: int), > ws_ext_sales_price (type: float), ws_sold_date_sk (type: int) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 143949044 Data size: 1727244892 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 > keys: > 0 _col2 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0, _col1 > input vertices: > 1 Map 25 > Statistics: Num rows: 600476032 Data size: 4803808256 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int), _col1 (type: > float), 1998 (type: int), 3 (type: int) > outputColumnNames: _col0, _col1, _col4, _col5 > Statistics: Num rows: 600476032 Data size: > 9607616512 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col1} {_col4} {_col5} > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col3, _col6, _col7 > input vertices: > 0 Map 6 > Statistics: Num rows: 600476032 Data size: > 7205712384 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col1 (type: string), _col7 (type: > int), _col6 (type: int), _col3 (type: float) > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 600476032 Data size: > 7205712384 Basic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col3) > keys: _col0 (type: string), _col1 (type: > int), _col2 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 73 Data size: 584 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string), > _col1 (type: int), _col2 (type: int) > sort order: +++ > Map-reduce partition columns: _col0 (type: > string), _col1 (type: int), _col2 (type: int) > Statistics: Num rows: 73 Data size: 584 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col3 (type: double) > Execution mode: vectorized > Map 18 > Map Operator Tree: > TableScan > alias: store_sales > filterExpr: ss_addr_sk is not null (type: boolean) > Statistics: Num rows: 550076554 Data size: 47370018896 > Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ss_addr_sk is not null (type: boolean) > Statistics: Num rows: 537118738 Data size: 6344204488 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ss_addr_sk (type: int), ss_ext_sales_price > (type: float), ss_sold_date_sk (type: int) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 537118738 Data size: 6344204488 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 > keys: > 0 _col2 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0, _col1 > input vertices: > 1 Map 5 > Statistics: Num rows: 600476032 Data size: 4803808256 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col1} > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col7 > input vertices: > 1 Map 24 > Statistics: Num rows: 600476032 Data size: > 58846651136 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col7 (type: string), 3 (type: int), > 1998 (type: int), _col1 (type: float) > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 600476032 Data size: > 58846651136 Basic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col3) > keys: _col0 (type: string), _col1 (type: int), > _col2 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 589 Data size: 4712 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string), _col1 > (type: int), _col2 (type: int) > sort order: +++ > Map-reduce partition columns: _col0 (type: > string), _col1 (type: int), _col2 (type: int) > Statistics: Num rows: 589 Data size: 4712 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col3 (type: double) > Execution mode: vectorized > Map 20 > Map Operator Tree: > TableScan > alias: web_sales > filterExpr: ws_bill_addr_sk is not null (type: boolean) > Statistics: Num rows: 143966864 Data size: 19001610332 > Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: ws_bill_addr_sk is not null (type: boolean) > Statistics: Num rows: 143949044 Data size: 1727244892 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ws_bill_addr_sk (type: int), > ws_ext_sales_price (type: float), ws_sold_date_sk (type: int) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 143949044 Data size: 1727244892 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 > keys: > 0 _col2 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col0, _col1 > input vertices: > 1 Map 3 > Statistics: Num rows: 600476032 Data size: 4803808256 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int), _col1 (type: > float), 1998 (type: int), 2 (type: int) > outputColumnNames: _col0, _col1, _col4, _col5 > Statistics: Num rows: 600476032 Data size: > 9607616512 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col1} {_col4} {_col5} > keys: > 0 _col0 (type: int) > 1 _col0 (type: int) > outputColumnNames: _col1, _col3, _col6, _col7 > input vertices: > 0 Map 23 > Statistics: Num rows: 600476032 Data size: > 7205712384 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col1 (type: string), _col7 (type: > int), _col6 (type: int), _col3 (type: float) > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 600476032 Data size: > 7205712384 Basic stats: COMPLETE Column stats: COMPLETE > Group By Operator > aggregations: sum(_col3) > keys: _col0 (type: string), _col1 (type: > int), _col2 (type: int) > mode: hash > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 73 Data size: 584 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string), > _col1 (type: int), _col2 (type: int) > sort order: +++ > Map-reduce partition columns: _col0 (type: > string), _col1 (type: int), _col2 (type: int) > Statistics: Num rows: 73 Data size: 584 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col3 (type: double) > Execution mode: vectorized > Map 22 > Map Operator Tree: > TableScan > alias: customer_address > filterExpr: (ca_address_sk is not null and ca_county is not > null) (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (ca_address_sk is not null and ca_county is > not null) (type: boolean) > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_county > (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 81600000 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string) > Execution mode: vectorized > Map 23 > Map Operator Tree: > TableScan > alias: customer_address > filterExpr: (ca_address_sk is not null and ca_county is not > null) (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (ca_address_sk is not null and ca_county is > not null) (type: boolean) > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_county > (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 81600000 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string) > Execution mode: vectorized > Map 24 > Map Operator Tree: > TableScan > alias: customer_address > filterExpr: (ca_address_sk is not null and ca_county is not > null) (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (ca_address_sk is not null and ca_county is > not null) (type: boolean) > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_county > (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 81600000 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string) > Execution mode: vectorized > Map 25 > Map Operator Tree: > TableScan > alias: date_dim > filterExpr: (((d_qoy = 3) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((d_qoy = 3) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 635 Data size: 7620 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 2540 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 635 Data size: 2540 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Group By Operator > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Dynamic Partitioning Event Operator > Target Input: web_sales > Partition key expr: ws_sold_date_sk > Statistics: Num rows: 635 Data size: 0 Basic > stats: PARTIAL Column stats: COMPLETE > Target column: ws_sold_date_sk > Target Vertex: Map 16 > Execution mode: vectorized > Map 3 > Map Operator Tree: > TableScan > alias: date_dim > filterExpr: (((d_qoy = 2) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((d_qoy = 2) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 635 Data size: 7620 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 2540 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 635 Data size: 2540 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Group By Operator > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Dynamic Partitioning Event Operator > Target Input: web_sales > Partition key expr: ws_sold_date_sk > Statistics: Num rows: 635 Data size: 0 Basic > stats: PARTIAL Column stats: COMPLETE > Target column: ws_sold_date_sk > Target Vertex: Map 20 > Execution mode: vectorized > Map 4 > Map Operator Tree: > TableScan > alias: date_dim > filterExpr: (((d_qoy = 2) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((d_qoy = 2) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 635 Data size: 7620 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 2540 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 635 Data size: 2540 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Group By Operator > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Dynamic Partitioning Event Operator > Target Input: store_sales > Partition key expr: ss_sold_date_sk > Statistics: Num rows: 635 Data size: 0 Basic > stats: PARTIAL Column stats: COMPLETE > Target column: ss_sold_date_sk > Target Vertex: Map 11 > Execution mode: vectorized > Map 5 > Map Operator Tree: > TableScan > alias: date_dim > filterExpr: (((d_qoy = 3) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((d_qoy = 3) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 635 Data size: 7620 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 2540 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 635 Data size: 2540 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Group By Operator > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Dynamic Partitioning Event Operator > Target Input: store_sales > Partition key expr: ss_sold_date_sk > Statistics: Num rows: 635 Data size: 0 Basic > stats: PARTIAL Column stats: COMPLETE > Target column: ss_sold_date_sk > Target Vertex: Map 18 > Execution mode: vectorized > Map 6 > Map Operator Tree: > TableScan > alias: customer_address > filterExpr: (ca_address_sk is not null and ca_county is not > null) (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (ca_address_sk is not null and ca_county is > not null) (type: boolean) > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_county > (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 81600000 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string) > Execution mode: vectorized > Map 7 > Map Operator Tree: > TableScan > alias: customer_address > filterExpr: (ca_address_sk is not null and ca_county is not > null) (type: boolean) > Statistics: Num rows: 800000 Data size: 811903688 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (ca_address_sk is not null and ca_county is > not null) (type: boolean) > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: ca_address_sk (type: int), ca_county > (type: string) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 800000 Data size: 81600000 Basic > stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 800000 Data size: 81600000 > Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: string) > Execution mode: vectorized > Map 8 > Map Operator Tree: > TableScan > alias: date_dim > filterExpr: (((d_qoy = 1) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((d_qoy = 1) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 635 Data size: 7620 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 2540 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 635 Data size: 2540 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Group By Operator > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Dynamic Partitioning Event Operator > Target Input: web_sales > Partition key expr: ws_sold_date_sk > Statistics: Num rows: 635 Data size: 0 Basic > stats: PARTIAL Column stats: COMPLETE > Target column: ws_sold_date_sk > Target Vertex: Map 14 > Execution mode: vectorized > Map 9 > Map Operator Tree: > TableScan > alias: date_dim > filterExpr: (((d_qoy = 1) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 73049 Data size: 81741831 Basic > stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (((d_qoy = 1) and (d_year = 1998)) and > d_date_sk is not null) (type: boolean) > Statistics: Num rows: 635 Data size: 7620 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: d_date_sk (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 2540 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: int) > sort order: + > Map-reduce partition columns: _col0 (type: int) > Statistics: Num rows: 635 Data size: 2540 Basic > stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: int) > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Group By Operator > keys: _col0 (type: int) > mode: hash > outputColumnNames: _col0 > Statistics: Num rows: 635 Data size: 0 Basic stats: > PARTIAL Column stats: COMPLETE > Dynamic Partitioning Event Operator > Target Input: store_sales > Partition key expr: ss_sold_date_sk > Statistics: Num rows: 635 Data size: 0 Basic > stats: PARTIAL Column stats: COMPLETE > Target column: ss_sold_date_sk > Target Vertex: Map 1 > Execution mode: vectorized > Reducer 12 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0) > keys: KEY._col0 (type: string), KEY._col1 (type: int), > KEY._col2 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col3 (type: double) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col0} {_col1} > 1 {_col1} > keys: > 0 _col0 (type: string) > 1 _col0 (type: string) > outputColumnNames: _col0, _col1, _col3 > input vertices: > 1 Reducer 19 > Statistics: Num rows: 600476032 Data size: 9607616512 > Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col1 (type: > double), _col3 (type: double) > outputColumnNames: _col0, _col1, _col3 > Statistics: Num rows: 600476032 Data size: 9607616512 > Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > Inner Join 0 to 2 > Inner Join 2 to 3 > condition expressions: > 0 {_col0} {_col1} > 1 {_col1} > 2 {_col0} {_col1} {_col2} > 3 {_col1} {_col3} > keys: > 0 _col0 (type: string) > 1 _col0 (type: string) > 2 _col0 (type: string) > 3 _col0 (type: string) > outputColumnNames: _col0, _col1, _col3, _col4, _col5, > _col6, _col8, _col10 > input vertices: > 0 Reducer 15 > 1 Reducer 21 > 2 Reducer 2 > Statistics: Num rows: 600476032 Data size: > 19215233024 Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (CASE WHEN ((_col1 > 0.0)) THEN ((_col3 > / _col1)) ELSE (null) END > CASE WHEN ((_col6 > 0.0)) THEN ((_col8 / _col6)) > ELSE (null) END) (type: boolean) > Statistics: Num rows: 200158677 Data size: > 6405077664 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col4 (type: string), _col5 (type: > int), _col3 (type: double), _col6 (type: double), _col8 (type: double), > _col10 (type: double), _col0 (type: string), _col1 (type: double) > outputColumnNames: _col0, _col1, _col10, _col2, > _col4, _col6, _col7, _col8 > Statistics: Num rows: 200158677 Data size: > 6405077664 Basic stats: COMPLETE Column stats: COMPLETE > Map Join Operator > condition map: > Inner Join 0 to 1 > condition expressions: > 0 {_col1} > 1 {_col0} {_col1} {_col2} {_col4} {_col6} > {_col8} {_col10} > keys: > 0 _col0 (type: string) > 1 _col7 (type: string) > outputColumnNames: _col1, _col2, _col3, _col4, > _col6, _col8, _col10, _col12 > input vertices: > 0 Reducer 17 > Statistics: Num rows: 600476032 Data size: > 19215233024 Basic stats: COMPLETE Column stats: COMPLETE > Filter Operator > predicate: (CASE WHEN ((_col12 > 0.0)) THEN > ((_col1 / _col12)) ELSE (null) END > CASE WHEN ((_col6 > 0.0)) THEN ((_col8 / > _col6)) ELSE (null) END) (type: boolean) > Statistics: Num rows: 200158677 Data size: > 6405077664 Basic stats: COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col2 (type: string), _col3 > (type: int), (_col12 / _col10) (type: double), (_col6 / _col4) (type: > double), (_col1 / _col12) (type: double), (_col8 / _col6) (type: double) > outputColumnNames: _col0, _col1, _col2, > _col3, _col4, _col5 > Statistics: Num rows: 200158677 Data size: > 8006347080 Basic stats: COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col2 (type: double) > sort order: + > Statistics: Num rows: 200158677 Data > size: 8006347080 Basic stats: COMPLETE Column stats: COMPLETE > value expressions: _col0 (type: string), > _col1 (type: int), _col3 (type: double), _col4 (type: double), _col5 (type: > double) > Reducer 13 > Reduce Operator Tree: > Select Operator > expressions: VALUE._col0 (type: string), VALUE._col1 (type: > int), KEY.reducesinkkey0 (type: double), VALUE._col2 (type: double), > VALUE._col3 (type: double), VALUE._col4 (type: double) > outputColumnNames: _col0, _col1, _col2, _col3, _col4, _col5 > Statistics: Num rows: 200158677 Data size: 8006347080 Basic > stats: COMPLETE Column stats: COMPLETE > File Output Operator > compressed: false > Statistics: Num rows: 200158677 Data size: 8006347080 Basic > stats: COMPLETE Column stats: COMPLETE > table: > input format: org.apache.hadoop.mapred.TextInputFormat > output format: > org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat > serde: > org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe > Execution mode: vectorized > Reducer 15 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0) > keys: KEY._col0 (type: string), KEY._col1 (type: int), > KEY._col2 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col3 (type: double) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string) > sort order: + > Map-reduce partition columns: _col0 (type: string) > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: double) > Execution mode: vectorized > Reducer 17 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0) > keys: KEY._col0 (type: string), KEY._col1 (type: int), > KEY._col2 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col3 (type: double) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string) > sort order: + > Map-reduce partition columns: _col0 (type: string) > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: double) > Execution mode: vectorized > Reducer 19 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0) > keys: KEY._col0 (type: string), KEY._col1 (type: int), > KEY._col2 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col3 (type: double) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string) > sort order: + > Map-reduce partition columns: _col0 (type: string) > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: double) > Execution mode: vectorized > Reducer 2 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0) > keys: KEY._col0 (type: string), KEY._col1 (type: int), > KEY._col2 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col2 (type: int), _col3 > (type: double) > outputColumnNames: _col0, _col1, _col2 > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string) > sort order: + > Map-reduce partition columns: _col0 (type: string) > Statistics: Num rows: 589 Data size: 9424 Basic stats: > COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: int), _col2 (type: double) > Execution mode: vectorized > Reducer 21 > Reduce Operator Tree: > Group By Operator > aggregations: sum(VALUE._col0) > keys: KEY._col0 (type: string), KEY._col1 (type: int), > KEY._col2 (type: int) > mode: mergepartial > outputColumnNames: _col0, _col1, _col2, _col3 > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > Select Operator > expressions: _col0 (type: string), _col3 (type: double) > outputColumnNames: _col0, _col1 > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > Reduce Output Operator > key expressions: _col0 (type: string) > sort order: + > Map-reduce partition columns: _col0 (type: string) > Statistics: Num rows: 73 Data size: 1168 Basic stats: > COMPLETE Column stats: COMPLETE > value expressions: _col1 (type: double) > Execution mode: vectorized > Stage: Stage-0 > Fetch Operator > limit: -1 > Processor Tree: > ListSink > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)