Eugene Koifman created HIVE-16177: ------------------------------------- Summary: non Acid to acid conversion doesn't handle _copy_N files Key: HIVE-16177 URL: https://issues.apache.org/jira/browse/HIVE-16177 Project: Hive Issue Type: Bug Components: Transactions Reporter: Eugene Koifman Priority: Critical
insert into T(a,b) values(1,2) insert into T(a,b) values(1,3) //we should now have bucket files 000001_0 and 000001_0_copy_1 but OrcRawRecordMerger.OriginalReaderPair.next() doesn't know that there can be copy_N files and numbers rows in each bucket from 0 thus generating duplicate IDs [~owen.omalley], do you have any thoughts on a good way to handle this? attached patch has a few changes to make Acid even recognize copy_N but this is just a pre-requisite. The new UT demonstrates the issue. -- This message was sent by Atlassian JIRA (v6.3.15#6346)