Re: Data loading from Datanode

2011-12-08 Thread Bejoy Ks
Hi Keshav
 Adding on to others comments. You can install hive anywhere, not 
necessary on the namenode. You can install the same on a data node or an 
utility server other than name node as well, I know a few large clusters that 
operates so.It applies the same with pig and other libraries. You can load the 
data into hive table even from a remote server provided there are sufficient 
permissions .
    AFAIK, on a high level when you load data from a LOCAL INPATH (dir 
in lfs), what it happens is that client sends a request to namenode to copy the 
file into hdfs. The name node do the needful and finally the data is stored as 
blocks in data nodes. The namenode has a fs image (file system image) and an 
entry of this new file would be made there. In short the name node is involved 
in the process from beginning itself and it is in sync. Also in hadoop it is 
not the name node that communicates and syncs up with data node but the data 
node that syncs up with name node with periodic heart beats.

Hope it helps!...

Regards
Bejoy.KS




 From: Jasper Knulst jasper.knu...@incentro.com
To: user@hive.apache.org 
Sent: Thursday, December 8, 2011 2:09 PM
Subject: Re: Data loading from Datanode
 

Hi Keshav,
What you want is not possible I guess. You can't submit anything into HDFS 
without the namenode. Datanodes reports their local blocks into the namenode. 
If the namenode does not know them it will instruct the datanode to delete 
them. 
But whats the point? If you submit local files to HDFS (via the namenode ) from 
a datanode, the nn will assign to first block placement directly on that dn 
(provided there is room etc.) and start replicating from there.
Cheers Jasper

Op 8 dec. 2011 08:24 schreef Savant, Keshav keshav.c.sav...@fisglobal.com 
het volgende:

Hi All,
 
Is it possible to load data (in HDFS) using Hive Load data query from any of 
the Datanode?
 
So that means can we insert files into datanode directly (or from hive 
installed on datanode) and then the master node syncs with datanodes later.
 
Keshav C Savant
_
The information contained in this message is proprietary and/or confidential. 
If you are not the intended recipient, please: (i) delete the message and all 
copies; (ii) do not disclose, distribute or use the message in any manner; and 
(iii) notify the sender immediately. In addition, please be aware that any 
message addressed to our domain is subject to archiving and review by persons 
other than the intended recipient. Thank you.


Data loading from Datanode

2011-12-07 Thread Savant, Keshav
Hi All,

 

Is it possible to load data (in HDFS) using Hive Load data query from
any of the Datanode?

 

So that means can we insert files into datanode directly (or from hive
installed on datanode) and then the master node syncs with datanodes
later.

 

Keshav C Savant

_
The information contained in this message is proprietary and/or confidential. 
If you are not the intended recipient, please: (i) delete the message and all 
copies; (ii) do not disclose, distribute or use the message in any manner; and 
(iii) notify the sender immediately. In addition, please be aware that any 
message addressed to our domain is subject to archiving and review by persons 
other than the intended recipient. Thank you.