RE: HDFS data transfer is faster than SCP based transfer?

2014-01-26 Thread John Lilley
x27;d need to know more about your application. john From: rab ra [mailto:rab...@gmail.com] Sent: Saturday, January 25, 2014 7:29 AM To: user@hadoop.apache.org Subject: RE: HDFS data transfer is faster than SCP based transfer? The input files are provided as argument to a binary being executed b

RE: HDFS data transfer is faster than SCP based transfer?

2014-01-25 Thread rab ra
HDFS to local disk? Can you read from > HDFS directly using the FileSystem interface? > > john > > > > *From:* Shekhar Sharma [mailto:shekhar2...@gmail.com] > *Sent:* Saturday, January 25, 2014 3:44 AM > *To:* user@hadoop.apache.org > *Subject:* Re: HDFS data trans

RE: HDFS data transfer is faster than SCP based transfer?

2014-01-25 Thread John Lilley
Subject: Re: HDFS data transfer is faster than SCP based transfer? We have the concept of short circuit reads which directly reads from data node which improve read performance. Do we have similar concept like short circuit writes On 25 Jan 2014 16:10, "Harsh J" mailto:ha...@cl

Re: HDFS data transfer is faster than SCP based transfer?

2014-01-25 Thread Shekhar Sharma
We have the concept of short circuit reads which directly reads from data node which improve read performance. Do we have similar concept like short circuit writes On 25 Jan 2014 16:10, "Harsh J" wrote: > There's a lot of difference here, although both do use TCP underneath, > but do note that SC

Re: HDFS data transfer is faster than SCP based transfer?

2014-01-25 Thread Harsh J
There's a lot of difference here, although both do use TCP underneath, but do note that SCP securely encrypts data but stock HDFS configuration does not. You can also ask SCP to compress data transfer via the "-C" argument btw - unsure if you already applied that pre-test - it may help show up som

Re: HDFS data transfer is faster than SCP based transfer?

2014-01-25 Thread Shekhar Sharma
WHEN u put the data or write into HDFS, 64kb of data is written on client side and then it is pushed through pipeline and this process continue till 64mb of data is written which is the block size defined by the client. While on the other hand scp will try to buffer the entire data. Passing chunks

Re: HDFS data transfer is faster than SCP based transfer?

2014-01-24 Thread rab ra
It is not a single file. Lot of small files. Files are stored in HDFS and map operations copies required files from hdfs. One map process running in one node only. Each file will be about 16MB On 24 Jan 2014 23:49, "Vinod Kumar Vavilapalli" wrote: > Is it a single file? Lots of files? How big are

Re: HDFS data transfer is faster than SCP based transfer?

2014-01-24 Thread Vinod Kumar Vavilapalli
Is it a single file? Lots of files? How big are the files? Is the copy on a single node or are you running some kind of a MapReduce program? +Vinod Hortonworks Inc. http://hortonworks.com/ On Fri, Jan 24, 2014 at 7:21 AM, rab ra wrote: > Hi > > Can anyone please answer my query? > > -Rab > ---