[
https://issues.apache.org/jira/browse/HIVE-7499?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14073078#comment-14073078
]
navdeep commented on HIVE-7499:
-------------------------------
analyze command for partition on date type
STAGE DEPENDENCIES:
Stage-0 is a root stage
Stage-1 is a root stage
STAGE PLANS:
Stage: Stage-0
Map Reduce
Map Operator Tree:
TableScan
alias: rrslog_orc_oozie
filterExpr: (server_date = ((2013 - 11) - 30)) (type: boolean)
Filter Operator
predicate: false (type: boolean)
Select Operator
expressions: load_inst_id (type: bigint)
outputColumnNames: load_inst_id
Group By Operator
aggregations: compute_stats(load_inst_id, 16)
mode: hash
outputColumnNames: _col0
Reduce Output Operator
sort order:
value expressions: _col0 (type:
struct<columntype:string,min:bigint,max:bigint,countnulls:bigint,bitvector:string,numbitvectors:int>)
Reduce Operator Tree:
Group By Operator
aggregations: compute_stats(VALUE._col0)
mode: mergepartial
outputColumnNames: _col0
Select Operator
expressions: _col0 (type:
struct<columntype:string,min:bigint,max:bigint,countnulls:bigint,numdistinctvalues:bigint>)
outputColumnNames: _col0
File Output Operator
compressed: false
table:
input format: org.apache.hadoop.mapred.TextInputFormat
output format:
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
Stage: Stage-1
Column Stats Work
Column Stats Desc:
Columns: load_inst_id
Column Types: bigint
Partition: server_date=2013-11-30
Table: rrslog_orc_oozie
------------------------------------------------------------------------------------------------------------------------------------------
analyze command on statistics with partition on id type
STAGE DEPENDENCIES:
Stage-0 is a root stage
Stage-1 is a root stage
STAGE PLANS:
Stage: Stage-0
Map Reduce
Map Operator Tree:
TableScan
alias: devstat
filterExpr: (load_inst_id = 0) (type: boolean)
Select Operator
expressions: reputation_score (type: int)
outputColumnNames: reputation_score
Group By Operator
aggregations: compute_stats(reputation_score, 16)
mode: hash
outputColumnNames: _col0
Reduce Output Operator
sort order:
value expressions: _col0 (type:
struct<columntype:string,min:bigint,max:bigint,countnulls:bigint,bitvector:string,numbitvectors:int>)
Reduce Operator Tree:
Group By Operator
aggregations: compute_stats(VALUE._col0)
mode: mergepartial
outputColumnNames: _col0
Select Operator
expressions: _col0 (type:
struct<columntype:string,min:bigint,max:bigint,countnulls:bigint,numdistinctvalues:bigint>)
outputColumnNames: _col0
File Output Operator
compressed: false
table:
input format: org.apache.hadoop.mapred.TextInputFormat
output format:
org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat
serde: org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe
Stage: Stage-1
Column Stats Work
Column Stats Desc:
Columns: reputation_score
Column Types: int
Partition: load_inst_id=0
Table: devstat
> Hive Statistics .wrong statistics getting updated.always getting initialized
> to 0 if partitioning is on date type.
> ------------------------------------------------------------------------------------------------------------------
>
> Key: HIVE-7499
> URL: https://issues.apache.org/jira/browse/HIVE-7499
> Project: Hive
> Issue Type: Bug
> Components: Statistics
> Affects Versions: 0.13.0
> Environment: hadoop 2.0, 0.13.0.2.1.2.1-471
> Reporter: navdeep
> Priority: Blocker
>
> computing statistics for column on table with partition on date type,
> statistics is not working .column is getting created for statistics in
> metastore but everything is initialized to 0 .
> i created the partition on int type statistics are working .
--
This message was sent by Atlassian JIRA
(v6.2#6252)