Re: Map parallelism

2010-12-15 Thread Charles W
t; > On Tue, Dec 14, 2010 at 8:10 PM, Charles W wrote: > > Hi, > > > > I have a question about map parallelism in Pig. > > > > I am using Pig to stream a file through a Python script that performs > some > > computationally expensive transforms. This pr

Re: Map parallelism

2010-12-14 Thread Dmitriy Ryaboy
Try set mapred.max.split.size $desired_split_size -D On Tue, Dec 14, 2010 at 8:10 PM, Charles W wrote: > Hi, > > I have a question about map parallelism in Pig. > > I am using Pig to stream a file through a Python script that performs some > computationally expensive transf

Map parallelism

2010-12-14 Thread Charles W
Hi, I have a question about map parallelism in Pig. I am using Pig to stream a file through a Python script that performs some computationally expensive transforms. This process is assigned to a single map task that can take a very long time if it happens to execute on one of the weaker nodes in