[ https://issues.apache.org/jira/browse/HIVE-1665?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12913880#action_12913880 ]
Namit Jain commented on HIVE-1665: ---------------------------------- By default, the scratch dir can be based on date etc. so that it can be easily cleaned up > drop operations may cause file leak > ----------------------------------- > > Key: HIVE-1665 > URL: https://issues.apache.org/jira/browse/HIVE-1665 > Project: Hadoop Hive > Issue Type: Bug > Reporter: He Yongqiang > Assignee: He Yongqiang > > Right now when doing a drop, Hive first drops metadata and then drops the > actual files. If file system is down at that time, the files will keep not > deleted. > Had an offline discussion about this: > to fix this, add a new conf "scratch dir" into hive conf. > when doing a drop operation: > 1) move data to scratch directory > 2) drop metadata > 3) if 2) failed, roll back 1) and report error 3.1 > if 2) succeeded, drop data from scratch directory 3.2 > 4) if 3.2 fails, we are ok because we assume the scratch dir will be emptied > manually. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.