Maps are failing if combiner is enabled
---------------------------------------
Key: PIG-1803
URL: https://issues.apache.org/jira/browse/PIG-1803
Project: Pig
Issue Type: Bug
Reporter: Alex Rovner
Fix For: 0.7.0
We are constantly hitting the java heap space memory issue if the combiner is
enabled on our jobs.
Configs:
pig.cachedbag.memusage=20
io.sort.mb=300
pig.exec.nocombiner=false
mapred.child.java.opts=-Xmx750m
Sample job:
{noformat}
A = LOAD '$INPUT' USING
com.contextweb.pig.CWHeaderLoader('$WORK_DIR/schema/rpt.xml');
AA = foreach A GENERATE checkPointStart, PublisherId, TagId,
ContextCategoryId,Impressions, Clicks, Actions;
DESCRIBE AA;
B = GROUP AA BY (checkPointStart, PublisherId, TagId,
ContextCategoryId);
result = FOREACH B GENERATE group, SUM(AA.Impressions) as Impressions,
SUM(AA.Clicks) as Clicks, SUM(AA.Actions) as Actions;
DESCRIBE result;
STORE result INTO '$OUTPUT' USING com.contextweb.pig.CWHeaderStore();
{noformat}
Mapper Error Log:
2011-01-12 18:43:22,084 FATAL org.apache.hadoop.mapred.Child: Error running
child : java.lang.OutOfMemoryError: Java heap space
at
org.apache.hadoop.mapred.MapTask$MapOutputBuffer.<init>(MapTask.java:799)
at
org.apache.hadoop.mapred.MapTask$NewOutputCollector.<init>(MapTask.java:549)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:631)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:315)
at org.apache.hadoop.mapred.Child$4.run(Child.java:217)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1063)
at org.apache.hadoop.mapred.Child.main(Child.java:211)
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.