Koji Noguchi created PIG-5385:
---------------------------------
Summary: Skip calling extra gc() before spilling large bag when
unnecessary
Key: PIG-5385
URL: https://issues.apache.org/jira/browse/PIG-5385
Project: Pig
Issue Type: Improvement
Reporter: Koji Noguchi
Assignee: Koji Noguchi
In order to determine when to start spilling bags, Pig uses MemoryNotification
for both MEMORY_THRESHOLD_EXCEEDED and MEMORY_COLLECTION_THRESHOLD_EXCEEDED.
https://docs.oracle.com/javase/8/docs/api/java/lang/management/MemoryNotificationInfo.html
Since spilling a large bag is expensive, Pig explicitly call System.gc() when
the expected size is huge. I think we can skip this step when notification is
based on MEMORY_COLLECTION_THRESHOLD_EXCEEDED since this means jvm has called
the gc already.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)