Re: Environment consideration for a research on scheduling

2011-09-27 Thread Merto Mertek
Desktop edition was chosen just to run the namemode and to monitor cluster statistics. Workernodes were chosen to run on ubuntu server edition because we find this configuration in several research papers. One of such configuration can be found in the paper for LATE scheduler (is maybe some source

Re: Environment consideration for a research on scheduling

2011-09-26 Thread Steve Loughran
On 23/09/11 16:09, GOEKE, MATTHEW (AG/1000) wrote: If you are starting from scratch with no prior Hadoop install experience I would configure stand-alone, migrate to pseudo distributed and then to fully distributed verifying functionality at each step by doing a simple word count run. Also, if

Re: Environment consideration for a research on scheduling

2011-09-24 Thread Merto Mertek
I agree, we will go the standard route. Like you suggested we will go step by step to the full cluster deployment. After the first node configuration we will use clonezilla to replicate it and then setup them one by one.. On the workernodes I was thinking to run ubuntu server, namenode will run u

RE: Environment consideration for a research on scheduling

2011-09-23 Thread GOEKE, MATTHEW (AG/1000)
If you are starting from scratch with no prior Hadoop install experience I would configure stand-alone, migrate to pseudo distributed and then to fully distributed verifying functionality at each step by doing a simple word count run. Also, if you don't mind using the CDH distribution then SCM /