[ 
https://issues.apache.org/jira/browse/HIVE-6469?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13974665#comment-13974665
 ] 

Ravi Prakash commented on HIVE-6469:
------------------------------------

There may be multiple users in each of those environments. So unless we have an 
isolated "environment" for each user (which is really unmanageable) the global 
settings for one user will affect other users using the same environment. What 
you are suggesting is much coarser granularity and would be ops nightmare if I 
am understanding your solution.

The use case that is being targeted here is that a user may on *1* instance 
choose to drop a (possibly big) table without sending it to Trash to avoid 
filling up her/his quota. We believe that the default Hive behavior of sending 
to Trash should be maintained (to prevent accidental data loss).

It might be worthwhile to be consistent with underlying Hadoop philosophy where 
users wanting to get rid of data (via 'hdfs dfs -rm') can choose whether or not 
to permanently remove that data (with a '-skipTrash') option. You could make 
all the same arguments about individual users caring/not caring about 
controlling this behavior for that case too.

Do you see any of your customers asking for the global config rather than 
on-demand flag? Perhaps that can be a separate JIRA?

> skipTrash option in hive command line
> -------------------------------------
>
>                 Key: HIVE-6469
>                 URL: https://issues.apache.org/jira/browse/HIVE-6469
>             Project: Hive
>          Issue Type: New Feature
>          Components: CLI
>    Affects Versions: 0.12.0
>            Reporter: Jayesh
>             Fix For: 0.12.1
>
>         Attachments: HIVE-6469.patch
>
>
> hive drop table command deletes the data from HDFS warehouse and puts it into 
> Trash.
> Currently there is no way to provide flag to tell warehouse to skip trash 
> while deleting table data.
> This ticket is to add skipTrash feature in hive command-line, that looks as 
> following. 
> hive -e "drop table skipTrash testTable"
> This would be good feature to add, so that user can specify when not to put 
> data into trash directory and thus not to fill hdfs space instead of relying 
> on trash interval and policy configuration to take care of disk filling issue.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to