Krisztian Kasa created HIVE-25918:
-------------------------------------
Summary: Invalid stats after multi inserting into the same
partition
Key: HIVE-25918
URL: https://issues.apache.org/jira/browse/HIVE-25918
Project: Hive
Issue Type: Bug
Components: Statistics
Reporter: Krisztian Kasa
Assignee: Krisztian Kasa
{code}
create table source(p int, key int,value string);
insert into source(p, key, value) values (101,42,'string42');
create table stats_part(key int,value string) partitioned by (p int);
from source
insert into stats_part select key, value, p
insert into stats_part select key, value, p;
select count(*) from stats_part;
{code}
In this case {{StatsOptimizer}} helps serving this query because the result
should be {{rowNum}} of the partition {{p=101}}. The result is
{code}
1
{code}
however it shloud be
{code}
2
{code}
because both insert branches inserts 1-1 records.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)