Riju Trivedi created HIVE-22452: ----------------------------------- Summary: CTAS query failure at DDL task stage doesn't clean out the target directory Key: HIVE-22452 URL: https://issues.apache.org/jira/browse/HIVE-22452 Project: Hive Issue Type: Bug Components: Hive Affects Versions: 3.1.2, 3.1.0 Reporter: Riju Trivedi Assignee: Riju Trivedi
CTAS query failure at DDL task stage due to HMS connection issue leaves the output file in target directory. Since DDL task stage happens after Tez DAG completion and MOVE Task , output file getsĀ already moved to target directory and does not get cleaned up after the query failure. Re-executing the same query causes a duplicate file under table location hence duplicate data. -- This message was sent by Atlassian Jira (v8.3.4#803005)