Re: Idle tasktracker eats CPU

2014-06-09 Thread Dmitry Sivachenko
On 10 июня 2014 г., at 5:58, Serge Blazhievsky wrote: > There are ongoing pings to jobtracker > > But 10% seems high Sometimes it is 10%, sometimes it is 200+%. Different nodes, different numbers. So pings can't explain that. > > Sent from my iPhone > >> On Jun 9, 2014, at 2:15, Dmitry

Re: Idle tasktracker eats CPU

2014-06-09 Thread Serge Blazhievsky
There are ongoing pings to jobtracker But 10% seems high Sent from my iPhone > On Jun 9, 2014, at 2:15, Dmitry Sivachenko wrote: > > Hello! > > I set up hadoop-1.2.1 on FreeBSD-10/stable with openjdk version 1.7.0_60. > > On the first glance it is doing well except one annoying thing: afte

From Traditional BI approach to Hadoop ecosystem

2014-06-09 Thread santosh.viswanathan
Hello Experts, What are the ares to be considered for moving traditional BI which involves ETL loads (Tools used Informatica) to Hadoop Ecosystem? Thanks, Santosh Karthikeyan This message is for the designated recipient only and may contain privileged,

Re: priority in the container request

2014-06-09 Thread Vinod Kumar Vavilapalli
Yes, priorities are assigned to ResourceRequests and you can ask multiple containers at the same priority level. You may not get all the containers together as today's scheduler lacks gang functionality. +Vinod On Jun 9, 2014, at 12:08 AM, Krishna Kishore Bonagiri wrote: > Hi, > > Can we

Re: Counters in MapReduce

2014-06-09 Thread Krishna Kumar
You should use FileStatus to decide what files you want to include in the InputPath, and use the FileSystem class to delete or process the intermediate / final paths. Moving each job in your iteration logic into different methods would help keep things simple. From: unmesha sreeveni mailto:u

Re: Counters in MapReduce

2014-06-09 Thread unmesha sreeveni
Ok I will check out with counters. And after I st iteration the input file to job1 will be the output file of job 3.How to give that. *Inorder to satisfy 2 conditions* First iteration : users input file after first iteration :job 3 's output file as job 1 s input. > -- > *Thanks & Regards * > >

Re: Counters in MapReduce

2014-06-09 Thread Kai Voigt
Like you said, just wrap your 3 jobs into a while loop and check the built-in counters, like the number of reduce output records to check if the job output was empty. Unfortunately, oozie cannot do iterations or loops of jobs, as it only supports DAGs. Kai Am 09.06.2014 um 10:31 schrieb unmes

Counters in MapReduce

2014-06-09 Thread unmesha sreeveni
I am trying to do iteration with map reduce. I have 3 sequence job running *//job1 configuration* *FileInputFormat.addInputPath(job1,new Path(args[0]));* *FileOutputFormat.setOutputPath(job1,out1);* *job1.waitForCompletion(true);* *job2 configuration* *FileInputFormat.addInputPath(job2,out1);* *F

Idle tasktracker eats CPU

2014-06-09 Thread Dmitry Sivachenko
Hello! I set up hadoop-1.2.1 on FreeBSD-10/stable with openjdk version 1.7.0_60. On the first glance it is doing well except one annoying thing: after executing some tasks, tasktracker process starts to eat CPU when idle. Sometimes it is 10-20% (numbers from top(1) output), sometimes it is 100-

priority in the container request

2014-06-09 Thread Krishna Kishore Bonagiri
Hi, Can we give the same value for priority when requesting multiple containers from the Application Master? Basically, I need all of those containers at the same time, and I am requesting them at the same time. So, I am thinking if we can do that? Thanks, Kishore