RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-29 Thread Stuti Awasthi
Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? I faced this problem too. Split the seq file in which ur data is there into Multiple files. Then run the matrix multiplication with the folder as input . If the folder contains N sequence files, N mappers will be created. On Monday

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-29 Thread Stuti Awasthi
[mailto:satish.bigd...@gmail.com] Sent: Tuesday, January 29, 2013 7:27 PM To: user@mahout.apache.org Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? I think I was able to create multiple reducers by setting this property mapred.reduce.tasks = 10 in the MR code. Try setting this. If it does

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-28 Thread Stuti Awasthi
User List Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? Mappers are usually extremely fast since they start themselves on top of the data and their job is usually just parsing and emitting key value pairs. Hadoop's choices are usually fine. If not it is usually because the mapper

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-28 Thread Sean Owen
List Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? Mappers are usually extremely fast since they start themselves on top of the data and their job is usually just parsing and emitting key value pairs. Hadoop's choices are usually fine. If not it is usually because the mapper

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-28 Thread satish verma
User List Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? Mappers are usually extremely fast since they start themselves on top of the data and their job is usually just parsing and emitting key value pairs. Hadoop's choices are usually fine. If not it is usually because

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-23 Thread Jonas Grote
[mailto:paliwalash...@gmail.com] Sent: Wednesday, January 16, 2013 5:44 PM To: user@mahout.apache.org Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? MatrixMultiplicationJob internally sets InputFormat as CompositeInputFormat JobConf conf = new JobConf(initialConf

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-23 Thread Sean Owen
- From: Ashish [mailto:paliwalash...@gmail.com] Sent: Wednesday, January 16, 2013 5:44 PM To: user@mahout.apache.org Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? MatrixMultiplicationJob internally sets InputFormat as CompositeInputFormat JobConf conf

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Stuti Awasthi
MapTask started. Please Suggest -Original Message- From: Sean Owen [mailto:sro...@gmail.com] Sent: Wednesday, January 16, 2013 1:23 PM To: Mahout User List Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? It's up to Hadoop in the end. Try calling

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Stuti Awasthi
, January 16, 2013 2:55 PM To: Mahout User List Subject: RE: MatrixMultiplicationJob runs with 1 mapper only ? Hey Sean, Thanks for response. MatrixMultiplicationJob help shows the usage like : usage: command [Generic Options] [Job-Specific Options] Here Generic Option can be provided by -D property

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Sean Owen
(mapred.max.split.size,10485760); a.setConf(conf); b.setConf(conf); a.times(b); Where Im going wrong. Any idea ? Thanks Stuti -Original Message- From: Stuti Awasthi Sent: Wednesday, January 16, 2013 2:55 PM To: Mahout User List Subject: RE: MatrixMultiplicationJob runs with 1 mapper only ? Hey

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Stuti Awasthi
- From: Sean Owen [mailto:sro...@gmail.com] Sent: Wednesday, January 16, 2013 4:46 PM To: Mahout User List Subject: RE: MatrixMultiplicationJob runs with 1 mapper only ? Why do you need multiple mappers? Is one too slow? Many are not necessarily faster for small input On Jan 16, 2013 10:46 AM, Stuti

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Ashish
List Subject: RE: MatrixMultiplicationJob runs with 1 mapper only ? Why do you need multiple mappers? Is one too slow? Many are not necessarily faster for small input On Jan 16, 2013 10:46 AM, Stuti Awasthi stutiawas...@hcl.com wrote: Hi, I tried to call programmatically also but facing

RE: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Stuti Awasthi
. Im not sure if output will be correct . -Original Message- From: Ashish [mailto:paliwalash...@gmail.com] Sent: Wednesday, January 16, 2013 5:44 PM To: user@mahout.apache.org Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? MatrixMultiplicationJob internally sets

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-16 Thread Ashish
if output will be correct . -Original Message- From: Ashish [mailto:paliwalash...@gmail.com] Sent: Wednesday, January 16, 2013 5:44 PM To: user@mahout.apache.org Subject: Re: MatrixMultiplicationJob runs with 1 mapper only ? MatrixMultiplicationJob internally sets InputFormat

Re: MatrixMultiplicationJob runs with 1 mapper only ?

2013-01-15 Thread Sean Owen
It's up to Hadoop in the end. Try calling FileInputFormat.setMaxInputSplitSize() with a smallish value, like your 10MB (1000). I don't know if Hadoop params can be set as sys properties like that anyway? On Wed, Jan 16, 2013 at 7:48 AM, Stuti Awasthi stutiawas...@hcl.com wrote: Hi, I am