when get ~4 (84/24) times less data on each reducer while increasing its
quantity from 24 to 84. It's good. Sometimes data is skewed and simple
reducers quantity bump doesn't help.
2014-04-15 16:41 GMT+04:00 leiwang...@gmail.com :
> I can fix this by changing heap size.
> But what confuse me is
Hi,
When i run the following:
REGISTER /home/hduser/Documents/ccc/Research/phd/code/ECentre/scripts/bags.py
USING streaming_python AS bp;
raw = LOAD 'hdfs:///user/hduser/smsCorpus_en_2012.04.30_all.xml' AS
(line:chararray);
b = foreach raw generate bp.enumerate_bag(line);
I get
Failed