Re: Result of the TPC-DS benchmark on Hive master branch

Sungwoo Park Tue, 17 Nov 2020 16:32:11 -0800

>
>  > 1. With hive.optimize.shared.work.dppunion=true, query 2 and 59 fail.
> Please see the attachment for stack traces.
>
> Even thru the exception seem to be a reoccurance of the previous issue -
> existing checks + HIVE-24360 should have restricted all incorrect cases.
> I built in some debug stuff while I made these patches - and it would help
> a lot to get a peek into those; but they need to be enabled by
> hand/etc...while I polish that a
> bit more - could you please share an EXPLAIN FORMATTED about one of the
> queries failing because of that patch?
>


Please see the attachment for the result of EXPLAIN on query 12. (EXPLAIN
FORMATTED seems to have some problem.)  Hive tries to create two broadcast
edges from Reducer 8 to Map 6, thus raising an exception.

 > 2. Query 14 fails in both cases, and it seems like another bug. Note
> that when hive.cbo.enable is set to true when running query 14.
>
> I think you will find some cbo exception in the hive logs - explaining why
> it resorts to the non-cbo path.
>

Indeed it raises RuntimeException:

20/11/17 13:04:22 ERROR parse.CalcitePlanner: CBO failed, skipping CBO.
java.lang.RuntimeException: equivalence mapping violation
  at
org.apache.hadoop.hive.ql.plan.mapper.PlanMapper.link(PlanMapper.java:220)

 Please see the attachment for the full stack trace.


>  > 3. For some queries, the number of rows is different between the two
> experiments. In most cases, it seems to be rounding errors, but the
> difference is rather large for
> some queries (e.g., query 29 and 58). Please see the attachment for the
> result.
>
> that's very odd - I've recently fixed a bug in swo which may have caused
> issues like this(HIVE-24365); I would recommend to compare the result with
> the whole thing off
> (hive.optimize.shared.work=false).
> If you could isolate and reproduce this in a qtest I could also dig into
> it.


For now, let me report the result of testing HIVE-24366. Please see the
attachment for the result.

HIVE-24366 (e9f72e654750de208227d46a22e983413b080c6c, Thu Nov 12)

TEZ-4238 (22fec6c0ecc7ebe6f6f28800935cc6f69794dad5, Thu Oct 8)
guava.version=19.0 in pom.xml
hadoop.version=3.1.0 in pom.xml

TPC-DS 100GB ORC

hive.execution.engine=tez
hive.execution.mode=container, Tez containers are not reused across queries.
hive.cbo.enable=true
hive.query.reexecution.stats.persist.scope=metastore (default value)

1) hive.optimize.shared.work = false
2) hive.optimize.shared.work = true, hive.optimize.shared.work.dppunion =
true
3)  hive.optimize.shared.work = true, hive.optimize.shared.work.dppunion =
false

For each case, the first column reports the execution time and the second
column reports the number of rows. If the number of rows is 1, it also
reports the sum of all values in the result.

Cheers,

--- Sungwoo

1.

=== Query 2

EXPLAIN with wscs as
 (select sold_date_sk
        ,sales_price
  from (select ws_sold_date_sk sold_date_sk
              ,ws_ext_sales_price sales_price
        from web_sales) x
        union all
       (select cs_sold_date_sk sold_date_sk
              ,cs_ext_sales_price sales_price
        from catalog_sales)),
 wswscs as
 (select d_week_seq,
        sum(case when (d_day_name='Sunday') then sales_price else null end) 
sun_sales,
        sum(case when (d_day_name='Monday') then sales_price else null end) 
mon_sales,
        sum(case when (d_day_name='Tuesday') then sales_price else  null end) 
tue_sales,
        sum(case when (d_day_name='Wednesday') then sales_price else null end) 
wed_sales,
        sum(case when (d_day_name='Thursday') then sales_price else null end) 
thu_sales,
        sum(case when (d_day_name='Friday') then sales_price else null end) 
fri_sales,
        sum(case when (d_day_name='Saturday') then sales_price else null end) 
sat_sales
 from wscs
     ,date_dim
 where d_date_sk = sold_date_sk
 group by d_week_seq)
 select d_week_seq1
       ,round(sun_sales1/sun_sales2,2)
       ,round(mon_sales1/mon_sales2,2)
       ,round(tue_sales1/tue_sales2,2)
       ,round(wed_sales1/wed_sales2,2)
       ,round(thu_sales1/thu_sales2,2)
       ,round(fri_sales1/fri_sales2,2)
       ,round(sat_sales1/sat_sales2,2)
 from
 (select wswscs.d_week_seq d_week_seq1
        ,sun_sales sun_sales1
        ,mon_sales mon_sales1
        ,tue_sales tue_sales1
        ,wed_sales wed_sales1
        ,thu_sales thu_sales1
        ,fri_sales fri_sales1
        ,sat_sales sat_sales1
  from wswscs,date_dim
  where date_dim.d_week_seq = wswscs.d_week_seq and
        d_year = 2001) y,
 (select wswscs.d_week_seq d_week_seq2
        ,sun_sales sun_sales2
        ,mon_sales mon_sales2
        ,tue_sales tue_sales2
        ,wed_sales wed_sales2
        ,thu_sales thu_sales2
        ,fri_sales fri_sales2
        ,sat_sales sat_sales2
  from wswscs
      ,date_dim
  where date_dim.d_week_seq = wswscs.d_week_seq and
        d_year = 2001+1) z
 where d_week_seq1=d_week_seq2-53
 order by d_week_seq1;

=== Output

+----------------------------------------------------+
|                      Explain                       |
+----------------------------------------------------+
| Plan optimized by CBO.                             |
|                                                    |
| Vertex dependency in root stage                    |
| Map 1 <- Map 6 (BROADCAST_EDGE), Union 2 (CONTAINS) |
| Map 10 <- Map 6 (BROADCAST_EDGE), Union 11 (CONTAINS) |
| Map 13 <- Map 6 (BROADCAST_EDGE), Union 11 (CONTAINS) |
| Map 5 <- Map 6 (BROADCAST_EDGE), Union 2 (CONTAINS) |
| Map 6 <- Reducer 8 (BROADCAST_EDGE), Reducer 9 (BROADCAST_EDGE), Reducer 9 
(BROADCAST_EDGE) |
| Reducer 12 <- Map 7 (BROADCAST_EDGE), Union 11 (SIMPLE_EDGE) |
| Reducer 3 <- Map 7 (BROADCAST_EDGE), Reducer 12 (BROADCAST_EDGE), Union 2 
(SIMPLE_EDGE) |
| Reducer 4 <- Reducer 3 (SIMPLE_EDGE)               |
| Reducer 8 <- Map 7 (CUSTOM_SIMPLE_EDGE)            |
| Reducer 9 <- Map 7 (CUSTOM_SIMPLE_EDGE)            |
|                                                    |
| Stage-0                                            |
|   Fetch Operator                                   |
|     limit:-1                                       |
|     Stage-1                                        |
|       Reducer 4 vectorized                         |
|       File Output Operator [FS_253]                |
|         Select Operator [SEL_252] (rows=11224 width=60) |
|           
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"] |
|         <-Reducer 3 [SIMPLE_EDGE] vectorized       |
|           SHUFFLE [RS_251]                         |
|             Select Operator [SEL_250] (rows=11224 width=60) |
|               
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"] |
|               Map Join Operator [MAPJOIN_249] (rows=11224 width=116) |
|                 Conds:MAPJOIN_248.(_col0 - 
53)=RS_246._col0(Inner),Output:["_col1","_col2","_col3","_col4","_col5","_col6","_col7","_col9","_col10","_col11","_col12","_col13","_col14","_col15","_col16"]
 |
|               <-Reducer 12 [BROADCAST_EDGE] vectorized |
|                 BROADCAST [RS_246]                 |
|                   PartitionCols:_col0              |
|                   Map Join Operator [MAPJOIN_245] (rows=367 width=60) |
|                     
Conds:GBY_244._col0=RS_208._col0(Inner),Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"]
 |
|                   <-Map 7 [BROADCAST_EDGE] vectorized |
|                     PARTITION_ONLY_SHUFFLE [RS_208] |
|                       PartitionCols:_col0          |
|                       Select Operator [SEL_205] (rows=367 width=4) |
|                         Output:["_col0"]           |
|                         Filter Operator [FIL_203] (rows=367 width=8) |
|                           predicate:((d_year = 2001) and d_week_seq is not 
null) |
|                           TableScan [TS_20] (rows=73049 width=8) |
|                             tpcds_bin_partitioned_orc_100@date_dim,date_dim, 
ACID table,Tbl:COMPLETE,Col:COMPLETE,Output:["d_week_seq","d_year"] |
|                   <-Group By Operator [GBY_244] (rows=11297 width=60) |
|                       
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"],aggregations:["sum(VALUE._col0)","sum(VALUE._col1)","sum(VALUE._col2)","sum(VALUE._col3)","sum(VALUE._col4)","sum(VALUE._col5)","sum(VALUE._col6)"],keys:KEY._col0
 |
|                     <-Union 11 [SIMPLE_EDGE]       |
|                       <-Map 10 [CONTAINS] vectorized |
|                         Reduce Output Operator [RS_263] |
|                           PartitionCols:_col0      |
|                           Group By Operator [GBY_262] (rows=1457313 width=60) 
|
|                             
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"],aggregations:["sum(_col1)","sum(_col2)","sum(_col3)","sum(_col4)","sum(_col5)","sum(_col6)","sum(_col7)"],keys:_col0
 |
|                             Select Operator [SEL_261] (rows=215260799 
width=39) |
|                               
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"] |
|                               Map Join Operator [MAPJOIN_260] (rows=215260799 
width=39) |
|                                 
Conds:SEL_259._col0=RS_227._col0(Inner),Output:["_col1","_col3","_col4","_col5","_col6","_col7","_col8","_col9","_col10"]
 |
|                               <-Map 6 [BROADCAST_EDGE] vectorized |
|                                 BROADCAST [RS_227] |
|                                   PartitionCols:_col0 |
|                                   Select Operator [SEL_222] (rows=73049 
width=40) |
|                                     
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7","_col8"]
 |
|                                     Filter Operator [FIL_220] (rows=73049 
width=103) |
|                                       predicate:(d_date_sk is not null and 
d_week_seq is not null and d_week_seq BETWEEN 
DynamicValue(RS_47_date_dim_d_week_seq_min) AND 
DynamicValue(RS_47_date_dim_d_week_seq_max) and in_bloom_filter(d_week_seq, 
DynamicValue(RS_47_date_dim_d_week_seq_bloom_filter))) |
|                                       TableScan [TS_8] (rows=73049 width=103) 
|
|                                         
tpcds_bin_partitioned_orc_100@date_dim,date_dim, ACID 
table,Tbl:COMPLETE,Col:COMPLETE,Output:["d_date_sk","d_week_seq","d_day_name"] |
|                                       <-Reducer 9 [BROADCAST_EDGE] vectorized 
|
|                                         BROADCAST [RS_218] |
|                                           Group By Operator [GBY_216] (rows=1 
width=12) |
|                                             
Output:["_col0","_col1","_col2"],aggregations:["min(VALUE._col0)","max(VALUE._col1)","bloom_filter(VALUE._col2,
 expectedEntries=1000000)"] |
|                                       <-Reducer 9 [BROADCAST_EDGE] vectorized 
|
|                                          Please refer to the previous 
BROADCAST [RS_218] |
|                                       <-Reducer 8 [BROADCAST_EDGE] vectorized 
|
|                                         BROADCAST [RS_215] |
|                                           Group By Operator [GBY_214] (rows=1 
width=12) |
|                                             
Output:["_col0","_col1","_col2"],aggregations:["min(VALUE._col0)","max(VALUE._col1)","bloom_filter(VALUE._col2,
 expectedEntries=1000000)"] |
|                               <-Select Operator [SEL_259] (rows=71983315 
width=15) |
|                                   Output:["_col0","_col1"] |
|                                   TableScan [TS_178] (rows=71983315 width=15) 
|
|                                     
tpcds_bin_partitioned_orc_100@web_sales,web_sales, ACID 
table,Tbl:COMPLETE,Col:COMPLETE,Output:["ws_ext_sales_price"] |
|                       <-Map 13 [CONTAINS] vectorized |
|                         Reduce Output Operator [RS_268] |
|                           PartitionCols:_col0      |
|                           Group By Operator [GBY_267] (rows=1457313 width=60) 
|
|                             
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"],aggregations:["sum(_col1)","sum(_col2)","sum(_col3)","sum(_col4)","sum(_col5)","sum(_col6)","sum(_col7)"],keys:_col0
 |
|                             Select Operator [SEL_266] (rows=215260799 
width=39) |
|                               
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"] |
|                               Map Join Operator [MAPJOIN_265] (rows=215260799 
width=39) |
|                                 
Conds:SEL_264._col0=RS_230._col0(Inner),Output:["_col1","_col3","_col4","_col5","_col6","_col7","_col8","_col9","_col10"]
 |
|                               <-Map 6 [BROADCAST_EDGE] vectorized |
|                                 BROADCAST [RS_230] |
|                                   PartitionCols:_col0 |
|                                    Please refer to the previous Select 
Operator [SEL_222] |
|                               <-Select Operator [SEL_264] (rows=143277484 
width=15) |
|                                   Output:["_col0","_col1"] |
|                                   TableScan [TS_190] (rows=143277484 
width=15) |
|                                     
tpcds_bin_partitioned_orc_100@catalog_sales,catalog_sales, ACID 
table,Tbl:COMPLETE,Col:COMPLETE,Output:["cs_ext_sales_price"] |
|               <-Map Join Operator [MAPJOIN_248] (rows=367 width=60) |
|                   
Conds:GBY_247._col0=RS_206._col0(Inner),Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"]
 |
|                 <-Map 7 [BROADCAST_EDGE] vectorized |
|                   PARTITION_ONLY_SHUFFLE [RS_206]  |
|                     PartitionCols:_col0            |
|                     Select Operator [SEL_204] (rows=367 width=4) |
+----------------------------------------------------+
|                      Explain                       |
+----------------------------------------------------+
|                       Output:["_col0"]             |
|                       Filter Operator [FIL_202] (rows=367 width=8) |
|                         predicate:((d_year = 2002) and d_week_seq is not 
null) |
|                          Please refer to the previous TableScan [TS_20] |
|                 <-Group By Operator [GBY_247] (rows=11297 width=60) |
|                     
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"],aggregations:["sum(VALUE._col0)","sum(VALUE._col1)","sum(VALUE._col2)","sum(VALUE._col3)","sum(VALUE._col4)","sum(VALUE._col5)","sum(VALUE._col6)"],keys:KEY._col0
 |
|                   <-Union 2 [SIMPLE_EDGE]          |
|                     <-Map 1 [CONTAINS] vectorized  |
|                       Reduce Output Operator [RS_243] |
|                         PartitionCols:_col0        |
|                         Group By Operator [GBY_242] (rows=1457313 width=60) |
|                           
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"],aggregations:["sum(_col1)","sum(_col2)","sum(_col3)","sum(_col4)","sum(_col5)","sum(_col6)","sum(_col7)"],keys:_col0
 |
|                           Select Operator [SEL_241] (rows=215260799 width=39) 
|
|                             
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"] |
|                             Map Join Operator [MAPJOIN_240] (rows=215260799 
width=39) |
|                               
Conds:SEL_239._col0=RS_223._col0(Inner),Output:["_col1","_col3","_col4","_col5","_col6","_col7","_col8","_col9","_col10"]
 |
|                             <-Map 6 [BROADCAST_EDGE] vectorized |
|                               BROADCAST [RS_223]   |
|                                 PartitionCols:_col0 |
|                                 Select Operator [SEL_221] (rows=73049 
width=40) |
|                                   
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7","_col8"]
 |
|                                   Filter Operator [FIL_219] (rows=73049 
width=103) |
|                                     predicate:(d_date_sk is not null and 
d_week_seq is not null and d_week_seq BETWEEN 
DynamicValue(RS_51_date_dim_d_week_seq_min) AND 
DynamicValue(RS_51_date_dim_d_week_seq_max) and (d_week_seq - 53) BETWEEN 
DynamicValue(RS_47_date_dim_d_week_seq_min) AND 
DynamicValue(RS_47_date_dim_d_week_seq_max) and in_bloom_filter(d_week_seq, 
DynamicValue(RS_51_date_dim_d_week_seq_bloom_filter)) and 
in_bloom_filter((d_week_seq - 53), 
DynamicValue(RS_47_date_dim_d_week_seq_bloom_filter))) |
|                                      Please refer to the previous TableScan 
[TS_8] |
|                             <-Select Operator [SEL_239] (rows=71983315 
width=15) |
|                                 Output:["_col0","_col1"] |
|                                 TableScan [TS_154] (rows=71983315 width=15) |
|                                   
tpcds_bin_partitioned_orc_100@web_sales,web_sales, ACID 
table,Tbl:COMPLETE,Col:COMPLETE,Output:["ws_ext_sales_price"] |
|                     <-Map 5 [CONTAINS] vectorized  |
|                       Reduce Output Operator [RS_258] |
|                         PartitionCols:_col0        |
|                         Group By Operator [GBY_257] (rows=1457313 width=60) |
|                           
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"],aggregations:["sum(_col1)","sum(_col2)","sum(_col3)","sum(_col4)","sum(_col5)","sum(_col6)","sum(_col7)"],keys:_col0
 |
|                           Select Operator [SEL_256] (rows=215260799 width=39) 
|
|                             
Output:["_col0","_col1","_col2","_col3","_col4","_col5","_col6","_col7"] |
|                             Map Join Operator [MAPJOIN_255] (rows=215260799 
width=39) |
|                               
Conds:SEL_254._col0=RS_226._col0(Inner),Output:["_col1","_col3","_col4","_col5","_col6","_col7","_col8","_col9","_col10"]
 |
|                             <-Map 6 [BROADCAST_EDGE] vectorized |
|                               BROADCAST [RS_226]   |
|                                 PartitionCols:_col0 |
|                                  Please refer to the previous Select Operator 
[SEL_221] |
|                             <-Select Operator [SEL_254] (rows=143277484 
width=15) |
|                                 Output:["_col0","_col1"] |
|                                 TableScan [TS_166] (rows=143277484 width=15) |
|                                   
tpcds_bin_partitioned_orc_100@catalog_sales,catalog_sales, ACID 
table,Tbl:COMPLETE,Col:COMPLETE,Output:["cs_ext_sales_price"] |
|                                                    |
+----------------------------------------------------+

2.

20/11/17 13:04:22 ERROR parse.CalcitePlanner: CBO failed, skipping CBO.
java.lang.RuntimeException: equivalence mapping violation
  at org.apache.hadoop.hive.ql.plan.mapper.PlanMapper.link(PlanMapper.java:220)
  at org.apache.hadoop.hive.ql.plan.mapper.PlanMapper.link(PlanMapper.java:192)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genFilterPlan(SemanticAnalyzer.java:3575)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genFilterPlan(SemanticAnalyzer.java:3538)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genBodyPlan(SemanticAnalyzer.java:10834)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11785)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11645)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11645)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11645)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11645)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11642)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11669)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.genPlan(SemanticAnalyzer.java:11655)
  at 
org.apache.hadoop.hive.ql.parse.CalcitePlanner.genOPTree(CalcitePlanner.java:618)
  at 
org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.analyzeInternal(SemanticAnalyzer.java:12548)
  at 
org.apache.hadoop.hive.ql.parse.CalcitePlanner.analyzeInternal(CalcitePlanner.java:443)
  at 
org.apache.hadoop.hive.ql.parse.BaseSemanticAnalyzer.analyze(BaseSemanticAnalyzer.java:301)
  at org.apache.hadoop.hive.ql.Compiler.analyze(Compiler.java:223)
  at org.apache.hadoop.hive.ql.Compiler.compile(Compiler.java:104)
  at org.apache.hadoop.hive.ql.Driver.compile(Driver.java:469)
  at org.apache.hadoop.hive.ql.Driver.compileInternal(Driver.java:421)
  at org.apache.hadoop.hive.ql.Driver.compileAndRespond(Driver.java:385)
  at org.apache.hadoop.hive.ql.Driver.compileAndRespond(Driver.java:379)
  at 
org.apache.hadoop.hive.ql.reexec.ReExecDriver.compileAndRespond(ReExecDriver.java:125)
  at 
org.apache.hive.service.cli.operation.SQLOperation.prepare(SQLOperation.java:199)
  at 
org.apache.hive.service.cli.operation.SQLOperation.runInternal(SQLOperation.java:262)
  at org.apache.hive.service.cli.operation.Operation.run(Operation.java:277)
  at 
org.apache.hive.service.cli.session.HiveSessionImpl.executeStatementInternal(HiveSessionImpl.java:560)
  at 
org.apache.hive.service.cli.session.HiveSessionImpl.executeStatementAsync(HiveSessionImpl.java:545)
  at sun.reflect.GeneratedMethodAccessor27.invoke(Unknown Source)
  at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
  at java.lang.reflect.Method.invoke(Method.java:498)
  at 
org.apache.hive.service.cli.session.HiveSessionProxy.invoke(HiveSessionProxy.java:78)
  at 
org.apache.hive.service.cli.session.HiveSessionProxy.access$000(HiveSessionProxy.java:36)
  at 
org.apache.hive.service.cli.session.HiveSessionProxy$1.run(HiveSessionProxy.java:63)
  at java.security.AccessController.doPrivileged(Native Method)
  at javax.security.auth.Subject.doAs(Subject.java:422)
  at 
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1682)
  at 
org.apache.hive.service.cli.session.HiveSessionProxy.invoke(HiveSessionProxy.java:59)
  at com.sun.proxy.$Proxy45.executeStatementAsync(Unknown Source)
  at 
org.apache.hive.service.cli.CLIService.executeStatementAsync(CLIService.java:315)
  at 
org.apache.hive.service.cli.thrift.ThriftCLIService.ExecuteStatement(ThriftCLIService.java:571)
  at 
org.apache.hive.service.rpc.thrift.TCLIService$Processor$ExecuteStatement.getResult(TCLIService.java:1550)
  at 
org.apache.hive.service.rpc.thrift.TCLIService$Processor$ExecuteStatement.getResult(TCLIService.java:1530)
  at org.apache.thrift.ProcessFunction.process(ProcessFunction.java:38)
  at org.apache.thrift.TBaseProcessor.process(TBaseProcessor.java:38)
  at 
org.apache.hive.service.auth.TSetIpAddressProcessor.process(TSetIpAddressProcessor.java:56)
  at 
org.apache.thrift.server.TThreadPoolServer$WorkerProcess.run(TThreadPoolServer.java:313)
  at 
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
  at 
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
  at java.lang.Thread.run(Thread.java:745)

hive4.hive-24366.tpcds.100gb.xlsx
Description: MS-Excel 2007 spreadsheet

Re: Result of the TPC-DS benchmark on Hive master branch

Reply via email to