Hi, in the first phase we are planning to establish a small cluster with few commodity computer (each 1GB, 200GB,..). Cluster would run ubuntu server 10.10 and a hadoop build from the branch 0.20.204 (i had some issues with version 0.20.203 with missing libraries<http://hadoop-common.472056.n3.nabble.com/Development-enviroment-problems-eclipse-hadoop-0-20-203-td3186022.html#a3188567>). Would you suggest any other version?
In the second phase we are planning to analyse, test and modify some of hadoop schedulers. Now I am interested what is the best way to deploy ubuntu and hadop to this few machine. I was thinking to configure the system in the local VM and then converting it to each physical machine but probably this is not the best option. If you know any other please share.. Thanks you!