Identifying new files on HDFS

2015-03-25 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, We have a requirement to process only new files in HDFS on a daily basis. I am sure this is a general requirement in many ETL kind of processing scenarios. Just wondering if there is a way to identify new files that are added to a path in HDFS? For example, assume already some files were prese

Identifying new files in HDFS

2015-03-25 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, We have a requirement to process only new files in HDFS on a daily basis. I am sure this is a general requirement in many ETL kind of processing scenarios. Just wondering if there is a way to identify new files that are added to a path in HDFS? For example, assume already some files were prese

Re: Significance of PID files

2014-07-03 Thread Vijaya Narayana Reddy Bhoomi Reddy
anks Vijay On 4 July 2014 10:36, Vikas srivastava wrote: > Paid files used to store the pic if particular process. Its main use is to > keep one process at a time...like one one datanode at a any host > On Jul 4, 2014 10:00 AM, Vijaya Narayana Reddy Bhoomi Reddy < > vijay.bho

Significance of PID files

2014-07-03 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, Can anyone please explain the significance of the pid files in Hadoop i.e. purpose and usage etc? Thanks & Regards Vijay

Re: HDFS File Writes & Reads

2014-06-19 Thread Vijaya Narayana Reddy Bhoomi Reddy
> Vijay > > > > On 17 June 2014 19:37, Zesheng Wu wrote: > > 1. HDFS doesn't allow parallel write > 2. HDFS use pipeline to write multiple replicas, so it doesn't take three > times more time than a traditional file write > 3. HDFS allow parallel read > &g

Re: HDFS File Writes & Reads

2014-06-18 Thread Vijaya Narayana Reddy Bhoomi Reddy
parallel write > 2. HDFS use pipeline to write multiple replicas, so it doesn't take three > times more time than a traditional file write > 3. HDFS allow parallel read > > > 2014-06-17 19:17 GMT+08:00 Vijaya Narayana Reddy Bhoomi Reddy < > vijay.bhoomire...@gmail.com>

HDFS File Writes & Reads

2014-06-17 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, I have a basic question regarding file writes and reads in HDFS. Is the file write and read process a sequential activity or executed in parallel? For example, lets assume that there is a File File1 which constitutes of three blocks B1, B2 and B3. 1. Will the write process write B2 only afte

RE: Business Analysts in Hadoop World

2013-06-28 Thread Vijaya Narayana Reddy Bhoomi Reddy
text/videos to get the insight. Best of luck Lokesh Chandra Basu B. Tech Computer Science and Engineering Indian Institute of Technology, Roorkee India(GMT +5hr 30min) On Fri, Jun 28, 2013 at 4:50 PM, Vijaya Narayana Reddy Bhoomi Reddy mailto:vijaya.bho...@huawei.com>> wrote: Hi, I am

Business Analysts in Hadoop World

2013-06-28 Thread Vijaya Narayana Reddy Bhoomi Reddy
Hi, I am just trying to get myself acquainted with Hadoop and other related technologies. I am very much fascinated with the potential of the Big Data world and hence would like to be part of it!! However, it has been a while I have done any coding. Earlier for a brief period of time during ear