[
https://issues.apache.org/jira/browse/PIG-5390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16878175#comment-16878175
]
Koji Noguchi commented on PIG-5390:
-----------------------------------
These self-spilling bags were added in pig 0.6 from PIG-1037, and the
regression of adding them to SpillableMemoryManager was probably added in pig
0.11 from PIG-2923.
> Possible race condition from Self-spilling bags registering with
> SpillableMemoryManager
> ----------------------------------------------------------------------------------------
>
> Key: PIG-5390
> URL: https://issues.apache.org/jira/browse/PIG-5390
> Project: Pig
> Issue Type: Bug
> Reporter: Koji Noguchi
> Assignee: Koji Noguchi
> Priority: Major
>
> This is a follow up from PIG-5380 where [~rohini] pointed out
> {quote}
> I think same change is required in InternalSortedBag as well as code is
> exactly same and it can spill too -
> https://github.com/apache/pig/blob/trunk/src/org/apache/pig/data/InternalSortedBag.java#L133
> . We most likely haven't seen issues with it as the probability could be
> very less as it will proactively spill if it exceeds cached memory limit.
> {quote}
> Looking at the history and the source, this is a critical bug given all these
> self-spilling bags are designed on the premise that no other threads would
> touch them. Comment in the source clearly say
> {code}
> * This bag is not registered with SpillableMemoryManager. It calculates
> * the number of tuples to hold in memory and spill pro-actively into files."
> {code}
>
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)