Ok,
So I tried putting the following config in the mapred-site.xml of all of my
nodes

<configuration>
  <property>
    <name>mapred.job.tracker</name>
    <value>name-node:54311</value>
  </property>
  <property>
    <name>mapred.map.tasks</name>
    <value>7</value>
  </property>
  <property>
    <name>mapred.reduce.tasks</name>
    <value>1</value>
  </property>
  <property>
    <name>mapred.tasktracker.map.tasks.maximum</name>
    <value>7</value>
  </property>
  <property>
    <name>mapred.tasktracker.reduce.tasks.maximum</name>
    <value>1</value>
  </property>
</configuration>

but when I start a new job it gets stuck at

11/06/28 03:04:47 INFO mapred.JobClient:  map 0% reduce 0%

Any thoughts?
Thanks for your help guys!

On Mon, Jun 27, 2011 at 7:33 PM, Juan P. <gordoslo...@gmail.com> wrote:

> Matt,
> Thanks for your help!
> I think I get it now, but this part is a bit confusing:
> *
> *
> *so: tasktracker/datanode and 6 slots left. How you break it up from there
> is your call but I would suggest either 4 mappers / 2 reducers or 5 mappers
> / 1 reducer.*
> *
> *
> If it's 2 processes per core, then it's: 4 Nodes * 4 Cores/Node * 2
> Processes/Core = 32 Processes Total
>
> So my configuration mapred-site.xml should include these props:
>
> *<property>*
> *  <name>mapred.map.tasks</name>*
> *  <value>28</value>*
> *</property>*
> *<property>*
> *  <name>mapred.reduce.tasks</name>*
> *  <value>4</value>*
> *</property>*
> *
> *
>
> Is that correct?
>
> On Mon, Jun 27, 2011 at 4:59 PM, GOEKE, MATTHEW (AG/1000) <
> matthew.go...@monsanto.com> wrote:
>
>> If you are running default configurations then you are only getting 2
>> mappers and 1 reducer per node. The rule of thumb I have gone on (and back
>> up by the definitive guide) is 2 processes per core so: tasktracker/datanode
>> and 6 slots left. How you break it up from there is your call but I would
>> suggest either 4 mappers / 2 reducers or 5 mappers / 1 reducer.
>>
>> Check out the below configs for details on what you are *most likely*
>> running currently:
>> http://hadoop.apache.org/common/docs/r0.20.2/mapred-default.html
>> http://hadoop.apache.org/common/docs/r0.20.2/hdfs-default.html
>> http://hadoop.apache.org/common/docs/r0.20.2/core-default.html
>>
>> HTH,
>> Matt
>>
>> -----Original Message-----
>> From: Juan P. [mailto:gordoslo...@gmail.com]
>> Sent: Monday, June 27, 2011 2:50 PM
>> To: common-user@hadoop.apache.org
>> Subject: Performance Tunning
>>
>> I'm trying to run a MapReduce task against a cluster of 4 DataNodes with 4
>> cores each.
>> My input data is 4GB in size and it's split into 100MB files. Current
>> configuration is default so block size is 64MB.
>>
>> If I understand it correctly Hadoop should be running 64 Mappers to
>> process
>> the data.
>>
>> I'm running a simple data counting MapReduce and it's taking about 30mins
>> to
>> complete. This seems like way too much, doesn't it?
>> Is there any tunning you guys would recommend to try and see an
>> improvement
>> in performance?
>>
>> Thanks,
>> Pony
>> This e-mail message may contain privileged and/or confidential
>> information, and is intended to be received only by persons entitled
>> to receive such information. If you have received this e-mail in error,
>> please notify the sender immediately. Please delete it and
>> all attachments from any servers, hard drives or any other media. Other
>> use of this e-mail by you is strictly prohibited.
>>
>> All e-mails and attachments sent and received are subject to monitoring,
>> reading and archival by Monsanto, including its
>> subsidiaries. The recipient of this e-mail is solely responsible for
>> checking for the presence of "Viruses" or other "Malware".
>> Monsanto, along with its subsidiaries, accepts no liability for any damage
>> caused by any such code transmitted by or accompanying
>> this e-mail or any attachment.
>>
>>
>> The information contained in this email may be subject to the export
>> control laws and regulations of the United States, potentially
>> including but not limited to the Export Administration Regulations (EAR)
>> and sanctions regulations issued by the U.S. Department of
>> Treasury, Office of Foreign Asset Controls (OFAC).  As a recipient of this
>> information you are obligated to comply with all
>> applicable U.S. export laws and regulations.
>>
>>
>

Reply via email to