[ 
https://issues.apache.org/jira/browse/HIVE-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13976297#comment-13976297
 ] 

Jayesh commented on HIVE-6469:
------------------------------

Xuefu,

This is really miner convenient feature which definitely has a use-case for our 
enterprise customer.
are you suggesting providing this feature via hive configuration that works in 
following way ?

set hive.warehouse.data.skipTrash = true                -- explicitly set
drop table large10TBTable                                                       
-- this will skip trash
drop table anyOtherTable                                                        
-- this will skip trash
set hive.warehouse.data.skipTrash = false               -- if you forget this, 
it will skipTrash forever, until corrected.
drop table regularTable                                 -- this will start 
placing data in trash

I believe that approach is not very intuitive and will lead to human error that 
creates disaster if necessary steps are not done, which ultimately violates 
hive feature of providing trash as backup.  

Also, different environment with different HS2 instance may not be the scenario 
here. This has proven to be very helpful on same environment by different users.

Also, I dont think this pollute SQL Syntax, think of this as PURGE option in 
Oracle DB and hence I totally see use it being used by enterprise customer.
http://docs.oracle.com/cd/B19306_01/server.102/b14200/statements_9003.htm

Did you get a chance to look at the links I put earlier, where people seen to 
be searching for this little convenient feature ?
Also did you get a chance to talk to any customers who would like such feature? 
Please let us know.

Thanks
Jayesh

> skipTrash option in hive command line
> -------------------------------------
>
>                 Key: HIVE-6469
>                 URL: https://issues.apache.org/jira/browse/HIVE-6469
>             Project: Hive
>          Issue Type: New Feature
>          Components: CLI
>    Affects Versions: 0.12.0
>            Reporter: Jayesh
>             Fix For: 0.12.1
>
>         Attachments: HIVE-6469.patch
>
>
> hive drop table command deletes the data from HDFS warehouse and puts it into 
> Trash.
> Currently there is no way to provide flag to tell warehouse to skip trash 
> while deleting table data.
> This ticket is to add skipTrash feature in hive command-line, that looks as 
> following. 
> hive -e "drop table skipTrash testTable"
> This would be good feature to add, so that user can specify when not to put 
> data into trash directory and thus not to fill hdfs space instead of relying 
> on trash interval and policy configuration to take care of disk filling issue.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to