(Phones are fun) With your setting of -Xmx512m for "mapred.child.java.opts" you don't have enough RAM for 4 reduce tasks. If you have single core Celerons, you also don't have enough CPU cores to run all four.
You might also try kicking -Xmx512m down to -Xmx256m and see if things run OK. Rick On May 2, 2011 9:18 AM, "Richard Nadeau" <strout...@gmail.com> wrote: > I would change "mapred.tasktracker.reduce.tasks.maximum" to one. With your > setting > > On May 2, 2011 8:48 AM, "baran cakici" <barancak...@gmail.com> wrote: >> without job; >> >> CPU Usage = 0% >> Memory = 585 MB (2GB Ram) >> >> Baran >> 2011/5/2 baran cakici <barancak...@gmail.com> >> >>> CPU Usage = 95-100% >>> Memory = 650-850 MB (2GB Ram) >>> >>> Baran >>> >>> >>> 2011/5/2 James Seigel <ja...@tynt.com> >>> >>>> If you have windows and cygwin you probably don't have a lot if memory >>>> left at 2 gig. >>>> >>>> Pull up system monitor on the data nodes and check for free memory >>>> when you have you jobs running. I bet it is quite low. >>>> >>>> I am not a windows guy so I can't take you much farther. >>>> >>>> James >>>> >>>> Sent from my mobile. Please excuse the typos. >>>> >>>> On 2011-05-02, at 8:32 AM, baran cakici <barancak...@gmail.com> wrote: >>>> >>>> > yes, I am running under cygwin on my datanodes too. OS of Datanodes > are >>>> > Windows as well. >>>> > >>>> > What can I do exactly for a better Performance. I changed >>>> > mapred.child.java.opts to default value.How can I solve this > "swapping" >>>> > problem? >>>> > >>>> > PS: I dont have a chance to get Slaves(Celeron 2GHz) with Liniux OS. >>>> > >>>> > thanks, both of you >>>> > >>>> > Regards, >>>> > >>>> > Baran >>>> > 2011/5/2 Richard Nadeau <strout...@gmail.com> >>>> > >>>> >> Are you running under cygwin on your data nodes as well? That is >>>> certain to >>>> >> cause performance problems. As James suggested, swapping to disk is >>>> going >>>> >> to >>>> >> be a killer, running on Windows with Celeron processors only > compounds >>>> the >>>> >> problem. The Celeron processor is also sub-optimal for CPU intensive >>>> tasks >>>> >> >>>> >> Rick >>>> >> >>>> >> On Apr 28, 2011 9:22 AM, "baran cakici" <barancak...@gmail.com> > wrote: >>>> >>> Hi Everyone, >>>> >>> >>>> >>> I have a Cluster with one Master(JobTracker and NameNode - Intel >>>> Core2Duo >>>> >> 2 >>>> >>> GB Ram) and four Slaves(Datanode and Tasktracker - Celeron 2 GB > Ram). >>>> My >>>> >>> Inputdata are between 2GB-10GB and I read Inputdata in MapReduce > line >>>> by >>>> >>> line. Now, I try to accelerate my System(Benchmark), but I'm not > sure, >>>> if >>>> >> my >>>> >>> Configuration is correctly. Can you please just look, if it is ok? >>>> >>> >>>> >>> -mapred-site.xml >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.job.tracker</name> >>>> >>> <value>apple:9001</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.child.java.opts</name> >>>> >>> <value>-Xmx512m -server</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.job.tracker.handler.count</name> >>>> >>> <value>2</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.local.dir</name> >>>> >>> >>>> >> >>>> > <value>/cygwin/usr/local/hadoop-datastore/hadoop-Baran/mapred/local</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.map.tasks</name> >>>> >>> <value>1</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.reduce.tasks</name> >>>> >>> <value>4</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.submit.replication</name> >>>> >>> <value>2</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.system.dir</name> >>>> >>> >>>> >> >>>> >> >>>> > <value>/cygwin/usr/local/hadoop-datastore/hadoop-Baran/mapred/system</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.tasktracker.indexcache.mb</name> >>>> >>> <value>10</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.tasktracker.map.tasks.maximum</name> >>>> >>> <value>1</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.tasktracker.reduce.tasks.maximum</name> >>>> >>> <value>4</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.temp.dir</name> >>>> >>> >>>> >> >>>> > <value>/cygwin/usr/local/hadoop-datastore/hadoop-Baran/mapred/temp</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>webinterface.private.actions</name> >>>> >>> <value>true</value> >>>> >>> </property> >>>> >>> >>>> >>> <property> >>>> >>> <name>mapred.reduce.slowstart.completed.maps</name> >>>> >>> <value>0.01</value> >>>> >>> </property> >>>> >>> >>>> >>> -hdfs-site.xml >>>> >>> >>>> >>> <property> >>>> >>> <name>dfs.block.size</name> >>>> >>> <value>268435456</value> >>>> >>> </property> >>>> >>> PS: I extended dfs.block.size, because I won 50% better performance >>>> with >>>> >>> this change. >>>> >>> >>>> >>> I am waiting for your comments... >>>> >>> >>>> >>> Regards, >>>> >>> >>>> >>> Baran >>>> >> >>>> >>> >>>