[ 
https://issues.apache.org/jira/browse/HIVE-22917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Marta Kuczora reassigned HIVE-22917:
------------------------------------


> Configuration for Hive to recognise non-empty destination folders
> -----------------------------------------------------------------
>
>                 Key: HIVE-22917
>                 URL: https://issues.apache.org/jira/browse/HIVE-22917
>             Project: Hive
>          Issue Type: Task
>            Reporter: Marta Kuczora
>            Assignee: Marta Kuczora
>            Priority: Major
>
> Currently Hive overwrites the LOCATION folder even if it is non-empty in case 
> of INSERT or CTAS.
> Investigate this behavior and if we can introduce a switch whereby any 
> ALTER/INSERT or CTAS or CREATE or DROP operation / transaction would be 
> aborted if the switch is ON and the LOCATION clause points at a non-empty 
> folder.
> {noformat}
> >> create table test (json_data string)
>  STORED AS TEXTFILE
>  LOCATION 'hdfs://host-10-17-102-132.coe.>ra.com:8020/tmp/test'
>  TBLPROPERTIES ('serialization.null.format' = '');
> >> insert into test values('test0');
> >> insert into test values('test1');
> >> insert into test values('test2');
> >> select * from test;
> INFO : Compiling 
> command(queryId=hive_20200207150101_601d6dbc-99cb-446d-86ac-6f8ce5304681): 
> select * from test
> INFO : Executing 
> command(queryId=hive_20200207150101_601d6dbc-99cb-446d-86ac-6f8ce5304681): 
> select * from test
> INFO : Completed executing 
> command(queryId=hive_20200207150101_601d6dbc-99cb-446d-86ac-6f8ce5304681); 
> Time taken: 0.001 seconds
> INFO : OK
> -----------------+
> test.json_data
> -----------------+
> test0
> test1
> test2
> -----------------+
> >> select * from test_id2;
> INFO : Compiling 
> command(queryId=hive_20200207145656_e99d1a0d-ea4c-4636-ae3a-dd930df14644): 
> select * from test_id2
> INFO : Executing 
> command(queryId=hive_20200207145656_e99d1a0d-ea4c-4636-ae3a-dd930df14644): 
> select * from test_id2
> INFO : Completed executing 
> command(queryId=hive_20200207145656_e99d1a0d-ea4c-4636-ae3a-dd930df14644); 
> Time taken: 0.001 seconds
> INFO : OK
> --------------+
> test_id2.id
> --------------+
> 1
> 13
> 14
> --------------+
> >> create table test2 (json_data int)
>  STORED AS TEXTFILE
>  LOCATION 'hdfs://host-10-17-102-132.coe.>ra.com:8020/tmp/test'
>  as SELECT * from test_id;
> INFO : Completed executing 
> command(queryId=hive_20200207150303_cbb57a17-1242-46dc-a98e-addf50f01c5b); 
> Time taken: 13.137 seconds
> INFO : OK
> No rows affected (13.226 seconds)
> SELECT * from test;
> INFO : Compiling 
> command(queryId=hive_20200207150404_d0aabd08-a15f-4e6c-99a3-e607b8a6cfd3): 
> SELECT * from test
> INFO : Executing 
> command(queryId=hive_20200207150404_d0aabd08-a15f-4e6c-99a3-e607b8a6cfd3): 
> SELECT * from test
> INFO : Completed executing 
> command(queryId=hive_20200207150404_d0aabd08-a15f-4e6c-99a3-e607b8a6cfd3); 
> Time taken: 0.001 seconds
> INFO : OK
> -----------------+
> test.json_data
> -----------------+
> 1
> 13
> 14
> -----------------+
> 3 rows selected (0.081 seconds)
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to