[ 
https://issues.apache.org/jira/browse/HIVE-25918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on HIVE-25918 started by Krisztian Kasa.
---------------------------------------------
> Invalid stats after multi inserting into the same partition
> -----------------------------------------------------------
>
>                 Key: HIVE-25918
>                 URL: https://issues.apache.org/jira/browse/HIVE-25918
>             Project: Hive
>          Issue Type: Sub-task
>          Components: Statistics
>            Reporter: Krisztian Kasa
>            Assignee: Krisztian Kasa
>            Priority: Major
>
> {code}
> create table source(p int, key int,value string);
> insert into source(p, key, value) values (101,42,'string42');
> create table stats_part(key int,value string) partitioned by (p int);
> from source
> insert into stats_part select key, value, p
> insert into stats_part select key, value, p;
> select count(*) from stats_part;
> {code}
> In this case {{StatsOptimizer}} helps serving this query because the result 
> should be {{rowNum}} of the partition {{p=101}}. The result is
> {code}
> 1
> {code}
> however it shloud be
> {code}
> 2
> {code}
> because both insert branches inserts 1-1 records.



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to