running multiple jobs, please help

2012-01-01 Thread Shaojun Zhao
Dear all, I have many jobs (900k) to run on many machines (4k) . All jobs are independent, particularly, they use the same algorithm, but the input is different. If I could build a single cluster with 4k machines, I can simple submit all my jobs using a shell script. Critically, the jobs will exec

Re: specify different number of mapper tasks for different machines

2010-08-30 Thread Shaojun Zhao
I beleive what Allen and Ted said, but so far, I did not try it out. -Sam On Mon, Aug 30, 2010 at 4:42 AM, Vitaliy Semochkin wrote: > Hi, > > Have you find the way to set different amount of mappers/reducers on a > particular node? > > On Wed, Jul 14, 2010 at 10:50 PM, Shaojun

specify different number of mapper tasks for different machines

2010-07-14 Thread Shaojun Zhao
Hi, I am running mapreduce on 5 machines, where I have 8 cores for 3 of them, but 2 cores for 2 of them, and the 8 core machines are more powerful (faster, more mem, more disk). Currently, I am using only the 3 machines (each with 8 cores), and the max number of mapper tasks is 8. I may use one o