[jira] Updated: (HIVE-1655) Adding consistency check at jobClose() when committing dynamic partitions

2010-09-21 Thread Namit Jain (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Namit Jain updated HIVE-1655:
-

   Status: Resolved  (was: Patch Available)
 Hadoop Flags: [Reviewed]
Fix Version/s: 0.7.0
   Resolution: Fixed

committed. Thanks Ning

 Adding consistency check at jobClose() when committing dynamic partitions
 -

 Key: HIVE-1655
 URL: https://issues.apache.org/jira/browse/HIVE-1655
 Project: Hadoop Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Fix For: 0.7.0

 Attachments: HIVE-1655.patch


 In case of dynamic partition insert, FileSinkOperator generated a directory 
 for a new partition and the files in the directory is named with '_tmp*'. 
 When a task succeed, the file is renamed to remove the _tmp, which 
 essentially implement the commit semantics. A lot of exceptions could 
 happen (process got killed, machine dies etc.) could left the _tmp files 
 exist in the DP directory. These _tmp files should be deleted (rolled back) 
 at successful jobClose(). After the deletion, we should also delete any empty 
 directories.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1655) Adding consistency check at jobClose() when committing dynamic partitions

2010-09-17 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1655:
-

Attachment: HIVE-1655.patch

 Adding consistency check at jobClose() when committing dynamic partitions
 -

 Key: HIVE-1655
 URL: https://issues.apache.org/jira/browse/HIVE-1655
 Project: Hadoop Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-1655.patch


 In case of dynamic partition insert, FileSinkOperator generated a directory 
 for a new partition and the files in the directory is named with '_tmp*'. 
 When a task succeed, the file is renamed to remove the _tmp, which 
 essentially implement the commit semantics. A lot of exceptions could 
 happen (process got killed, machine dies etc.) could left the _tmp files 
 exist in the DP directory. These _tmp files should be deleted (rolled back) 
 at successful jobClose(). After the deletion, we should also delete any empty 
 directories.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.



[jira] Updated: (HIVE-1655) Adding consistency check at jobClose() when committing dynamic partitions

2010-09-17 Thread Ning Zhang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HIVE-1655?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ning Zhang updated HIVE-1655:
-

Status: Patch Available  (was: Open)

 Adding consistency check at jobClose() when committing dynamic partitions
 -

 Key: HIVE-1655
 URL: https://issues.apache.org/jira/browse/HIVE-1655
 Project: Hadoop Hive
  Issue Type: Improvement
Reporter: Ning Zhang
Assignee: Ning Zhang
 Attachments: HIVE-1655.patch


 In case of dynamic partition insert, FileSinkOperator generated a directory 
 for a new partition and the files in the directory is named with '_tmp*'. 
 When a task succeed, the file is renamed to remove the _tmp, which 
 essentially implement the commit semantics. A lot of exceptions could 
 happen (process got killed, machine dies etc.) could left the _tmp files 
 exist in the DP directory. These _tmp files should be deleted (rolled back) 
 at successful jobClose(). After the deletion, we should also delete any empty 
 directories.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.