[ https://issues.apache.org/jira/browse/HIVE-10685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
AnfengYuan updated HIVE-10685: ------------------------------ Description: "Orders" table has 1500000000 rows and stored as ORC. hive> select count(*) from orders; OK 1500000000 Time taken: 37.692 seconds, Fetched: 1 row(s) The table contain 14 files,the size of each file is about 2.1 ~ 3.2 GB. After executing command : ALTER TABLE orders CONCATENATE; The table is already 1530115000 rows. My hive version is 1.1.0. was: "Orders" table has 1500000000 rows and stored as ORC. hive> select count(*) om orders; OK 1500000000 Time taken: 37.692 seconds, Fetched: 1 row(s) The table contain 14 files,the size of each file is about 2.1 ~ 3.2 GB. After executing command : ALTER TABLE orders CONCATENATE; The table is already 1530115000 rows. My hive version is 1.1.0. > Alter table concatenate oparetor will cause duplicate data > ---------------------------------------------------------- > > Key: HIVE-10685 > URL: https://issues.apache.org/jira/browse/HIVE-10685 > Project: Hive > Issue Type: Bug > Reporter: guoliming > Assignee: guoliming > > "Orders" table has 1500000000 rows and stored as ORC. > hive> select count(*) from orders; > OK > 1500000000 > Time taken: 37.692 seconds, Fetched: 1 row(s) > The table contain 14 files,the size of each file is about 2.1 ~ 3.2 GB. > After executing command : ALTER TABLE orders CONCATENATE; > The table is already 1530115000 rows. > My hive version is 1.1.0. -- This message was sent by Atlassian JIRA (v6.3.4#6332)