Zsolt Miskolczi created HIVE-27637:
--------------------------------------

             Summary: Compare highest write ID of compaction records when 
trying to perform abort cleanup
                 Key: HIVE-27637
                 URL: https://issues.apache.org/jira/browse/HIVE-27637
             Project: Hive
          Issue Type: Task
          Components: Hive
            Reporter: Zsolt Miskolczi
            Assignee: Zsolt Miskolczi


Compare highest write ID of compaction records when trying to get the potential 
table/partitions for abort cleanup.

Idea: If there exists a highest write ID of a record in COMPACTION_QUEUE for a 
table/partition which is greater than the max(aborted write ID) for that 
table/partition, then we can potentially ignore abort cleanup for such 
tables/partitions. This is because compaction will perform cleanup of obsolete 
deltas and aborted deltas hence doing abort cleanup is redundant here.

This is more of an optimisation since it can potentially save some filesystem 
operations (mainly file-listing during construction of Acid state).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to