[ https://issues.apache.org/jira/browse/HIVE-718?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12745569#action_12745569 ]
Prasad Chakka commented on HIVE-718: ------------------------------------ Two statements issue load into same partition around the same time. For one, dirExisted will be true and false for the other. Suppose the 'false' stmt copies a file named 'a1' first and then 'true' stmt will fail if it copies the same file. so it will try to undo the previous copies and then delete the dir. But the 'false' stmt keeps copying the files blissfully and succeed but the files 'a1' and others that were copied before the 'true' stmt deleted the directory will not be there but there won't be an error for the 'false' stmt. hoping my writing is understandable enough :) > Load data inpath into a new partition without overwrite does not move the file > ------------------------------------------------------------------------------ > > Key: HIVE-718 > URL: https://issues.apache.org/jira/browse/HIVE-718 > Project: Hadoop Hive > Issue Type: Bug > Affects Versions: 0.4.0 > Reporter: Zheng Shao > Attachments: HIVE-718.1.patch, HIVE-718.2.patch, hive-718.txt > > > The bug can be reproduced as following. Note that it only happens for > partitioned tables. The select after the first load returns nothing, while > the second returns the data correctly. > insert.txt in the current local directory contains 3 lines: "a", "b" and "c". > {code} > > create table tmp_insert_test (value string) stored as textfile; > > load data local inpath 'insert.txt' into table tmp_insert_test; > > select * from tmp_insert_test; > a > b > c > > create table tmp_insert_test_p ( value string) partitioned by (ds string) > > stored as textfile; > > load data local inpath 'insert.txt' into table tmp_insert_test_p partition > > (ds = '2009-08-01'); > > select * from tmp_insert_test_p where ds= '2009-08-01'; > > load data local inpath 'insert.txt' into table tmp_insert_test_p partition > > (ds = '2009-08-01'); > > select * from tmp_insert_test_p where ds= '2009-08-01'; > a 2009-08-01 > b 2009-08-01 > d 2009-08-01 > {code} -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.