Koji Noguchi created PIG-5385:
---------------------------------

             Summary: Skip calling extra gc() before spilling large bag when 
unnecessary
                 Key: PIG-5385
                 URL: https://issues.apache.org/jira/browse/PIG-5385
             Project: Pig
          Issue Type: Improvement
            Reporter: Koji Noguchi
            Assignee: Koji Noguchi


In order to determine when to start spilling bags, Pig uses MemoryNotification 
for both MEMORY_THRESHOLD_EXCEEDED and MEMORY_COLLECTION_THRESHOLD_EXCEEDED.
https://docs.oracle.com/javase/8/docs/api/java/lang/management/MemoryNotificationInfo.html

Since spilling a large bag is expensive, Pig explicitly call System.gc() when 
the expected size is huge.  I think we can skip this step when notification is 
based on MEMORY_COLLECTION_THRESHOLD_EXCEEDED since this means jvm has called 
the gc already.




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to