Re: the question of hadoop

Steve Loughran Mon, 06 Sep 2010 03:15:18 -0700

On 06/09/10 09:32, 褚 鵬兵 wrote:


hi ,my hadoop friends:i have the 3 questions about hadoop.there are ....

1 the speed between the datanodes.   Tera data in one datanodes ,   the data  
transfers from one datanode to the another datanode.   if the speed  is bad, 
Hadoop will be slow, i think.   i heard the gNet architecture in Greenplum ,  
then hadoop ?  SAS storage + G-Ethernet is best answer, isn't it?

if your code has locality gigabit ether is fine, saves the hassle ofgetting faster stuff to work. Have you ever tried to debug infinibandcluster problems?

2 the GUI tool   there is a hive web tool in hadoop.   but it is not enough to 
use it for our business work.   it is too simple to use it.
    if hadoop+hive is designed into DWH.   then how to use it for users.   by 
CGI Tool(Command),?   by New Developed webGUITOOL.?

the community welcomes new contributions. I'd look at cascading,datameeer's stuff, and other things. Hive is designed for people whoknow SQL, like PHP developers.

3 5 computers Hadoop cluster and 1 computer SQLSERVER2000   5 computers Hadoop  
    celeron 2.66G      1G memory      Ethernet      namenode + 
secondarynamenode + 3 datanode   1 computer SQLSERVER2000      celeron 2.66G    
  1G memory  then i did select operation at the same data 100M .    5 computers 
Hadoop  is 2mins 30secs   1 computer SQLSERVER2000  is 2mins 25secs
the result is that  5 computers Hadoop is not good .why .can anyone give me 
some advises.
thanks in adverse.

Indexes give RBMS speed, but limit their scale. If your dataset fitsonto a single mssql or mysql and you can afford the index costs, staywith that data in a RAID array. Hadoop isn't trying to compete in thatspace -though things like CouchDB are trying to

However, before you dismiss Hadoop, get in touch with your SQL server ororacle account team and say "we are planning on working with 15Petabytes of storage with data coming in at 1-2PB/month" and see whatthey say back and how big their quote is. The search terms "MapReduce aMajor Step Backwards" shows some of the debate going on.

Re: the question of hadoop

Reply via email to