hi ,my hadoop friends:i have the 3 questions about hadoop.there are ....
1 the speed between the datanodes. Tera data in one datanodes , the data
transfers from one datanode to the another datanode. if the speed is bad,
Hadoop will be slow, i think. i heard the gNet architecture in Greenplum ,
then hadoop ? SAS storage + G-Ethernet is best answer, isn't it?
2 the GUI tool there is a hive web tool in hadoop. but it is not enough to
use it for our business work. it is too simple to use it.
if hadoop+hive is designed into DWH. then how to use it for users. by
CGI Tool(Command),? by New Developed webGUITOOL.?
3 5 computers Hadoop cluster and 1 computer SQLSERVER2000 5 computers Hadoop
celeron 2.66G 1G memory Ethernet namenode +
secondarynamenode + 3 datanode 1 computer SQLSERVER2000 celeron 2.66G
1G memory then i did select operation at the same data 100M . 5 computers
Hadoop is 2mins 30secs 1 computer SQLSERVER2000 is 2mins 25secs
the result is that 5 computers Hadoop is not good .why .can anyone give me
some advises.
thanks in adverse.