Re: under cygwin JUST tasktracker run by cyg_server user, Permission denied .....

2011-11-04 Thread Uma Maheswara Rao G 72686
in 205, code is different than trace Which version are you using? I just verified the code in older versions, http://mail-archives.apache.org/mod_mbox/hadoop-common-commits/201109.mbox/%3c20110902221116.d0b192388...@eris.apache.org%3E below is the code snippet. +boolean rv = true; + +

Re: Map-Reduce in memory

2011-11-04 Thread Michel Segel
Hi, First, you have 8 physical cores. Hyper threading makes the machine think that it has 16. The trouble is that you really don't have 16 cores so you need to be a little more conservative. You don't mention HBase, so I'm going to assume that you don't have it installed. So in terms of

Re: someone know how to install hadoop0.20 on hp-ux?

2011-11-04 Thread Joey Echeverria
You need to create a log directory on your TaskTracker nodes: /opt/ecip/BMC/hadoopTest/hadoop-0.20.203.0/logs/ Make sure the directory is writable by the mapred user, or which ever user your TaskTrackers were started as. -Joey On Thu, Nov 3, 2011 at 11:11 PM, Li, Yonggang yongga...@hp.com

mapred.map.tasks getting set, but not sure where

2011-11-04 Thread Brendan W.
Hi, In the jobs running on my cluster of 20 machines, I used to run jobs (via hadoop jar ...) that would spawn around 4000 map tasks. Now when I run the same jobs, that number is 20; and I notice that in the job configuration, the parameter mapred.map.tasks is set to 20, whereas it never used to

Re: mapred.map.tasks getting set, but not sure where

2011-11-04 Thread Robert Evans
What versions of Hadoop were you running with previously, and what version are you running with now? --Bobby Evans On 11/4/11 9:33 AM, Brendan W. bw8...@gmail.com wrote: Hi, In the jobs running on my cluster of 20 machines, I used to run jobs (via hadoop jar ...) that would spawn around 4000

Re: mapred.map.tasks getting set, but not sure where

2011-11-04 Thread Brendan W.
All the same, no change in that...0.20.2. Other people do have access to this system to change things like conf files, but nobody's owning up and I have to figure this out. I have verified that the mapred.map.tasks property is not getting set in the mapred-site.xml files on the cluster or in the

Re: mapred.map.tasks getting set, but not sure where

2011-11-04 Thread Robert Evans
In 0.20.2 The JobClient will update mapred.map.tasks to be equal to the number of splits returned by the InputFormat. The input format will usually take mapred.map.tasks as a recommendation when deciding on what splits to make. That is the only place in the code that I could find that is

Re: mapred.map.tasks getting set, but not sure where

2011-11-04 Thread Brendan W.
Plain Java MR, using the Cassandra inputFormat to read out of Cassandra. Perhaps somebody hacked the inputFormat code on me... But what's weird is that the parameter mapred.map.tasks didn't appear in the job confs before at all. Now it does, with a value of 20 (happens to be the # of machines

Terrasort sends everything to a single reducer–Don't Apologize, David Salle

2011-11-04 Thread W.P. McNeill
I'm trying to run a TeraSort job to confirm that my cluster is set up correctly. The mappers perform fine, but in the reduce stage all the data is sent to a single node. My mapred.reduce.tasks parameter is set to an appropriate value greater than 1. I am launching multiple reducers, but only one

Re: Terrasort sends everything to a single reducer–Don't Apologize, David Salle

2011-11-04 Thread Raj V
Could you pastebin the Terasort configuration xml file? I have run Terasort over 1000 times but I have never seen this problem. How did you generate the data for terasort? using teragen or some other method? Raj Stoser Analytics www.stoser.com From: W.P.

When can I use null for Reducer value?

2011-11-04 Thread Shi Jin
Hi there, I am learning hadoop and looking at the two example Java codes SecondarySort.java and WordCount.java, using the latest stable version 0.20.203.0. One interesting feature I found in the SecondarySort.java code is the use of the null for the value sent by the reducer. The code is copied