Normally MR job is used for batch processing. So I don't think this is a
good use case here for MR.
Since you need to run the program periodically, you cannot submit a single
mapreduce job for this.
An possible way is to create a cron job to scan the folder size and submit
a MR job if necessary;



On Wed, Aug 27, 2014 at 7:38 PM, Kandoi, Nikhil <nikhil.kan...@emc.com>
wrote:

> Hi All,
>
>
>
> I have a system where files are coming in hdfs at regular intervals and I
> perform an operation everytime the directory size goes above a particular
> point.
>
> My Question is that when I submit a map reduce job, would it only work on
> the files present at that point ??
>
>
>
> Regards,
> Nikhil Kandoi
>
>
>



-- 
Regards,
*Stanley Shi,*

Reply via email to