[jira] [Assigned] (HIVE-22452) CTAS query failure at DDL task stage doesn't clean out the target directory

2019-11-04 Thread Riju Trivedi (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-22452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Riju Trivedi reassigned HIVE-22452:
---

Assignee: Marta Kuczora  (was: Riju Trivedi)

> CTAS query failure at DDL task stage doesn't clean out the target directory
> ---
>
> Key: HIVE-22452
> URL: https://issues.apache.org/jira/browse/HIVE-22452
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Affects Versions: 3.1.0, 3.1.2
>Reporter: Riju Trivedi
>Assignee: Marta Kuczora
>Priority: Major
>
> CTAS query failure at DDL task stage due to HMS connection issue leaves the 
> output file in target directory. Since DDL task stage happens after Tez DAG 
> completion and MOVE Task , output file getsĀ  already moved to target 
> directory and does not get cleaned up after the query failure.
> Re-executing the same query causes a duplicate file under table location 
> hence duplicate data.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (HIVE-22452) CTAS query failure at DDL task stage doesn't clean out the target directory

2019-11-04 Thread Riju Trivedi (Jira)


 [ 
https://issues.apache.org/jira/browse/HIVE-22452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Riju Trivedi reassigned HIVE-22452:
---


> CTAS query failure at DDL task stage doesn't clean out the target directory
> ---
>
> Key: HIVE-22452
> URL: https://issues.apache.org/jira/browse/HIVE-22452
> Project: Hive
>  Issue Type: Bug
>  Components: Hive
>Affects Versions: 3.1.2, 3.1.0
>Reporter: Riju Trivedi
>Assignee: Riju Trivedi
>Priority: Major
>
> CTAS query failure at DDL task stage due to HMS connection issue leaves the 
> output file in target directory. Since DDL task stage happens after Tez DAG 
> completion and MOVE Task , output file getsĀ  already moved to target 
> directory and does not get cleaned up after the query failure.
> Re-executing the same query causes a duplicate file under table location 
> hence duplicate data.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)