[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13750707#comment-13750707
 ] 

Shinichi Yamashita commented on MAPREDUCE-1729:
-----------------------------------------------

Hi,

When a target file is updated during MapReduce Job, the result will change.
Because it may cause inconsistencies in the result of MapReduce Job, the 
architecture of the current DistributedCache is valid.
But it may set the following options,

1. Addition of a property the update of the target file in MapReduce Job
2. It copies a target file before MapReduce Job start in another directory and 
use that file

By any policy, this know-how should document it as an important usage of 
DistributedCache.
                
> Distributed cache should provide an option to fail the job or not, if cache 
> file gets modified on the fly.
> ----------------------------------------------------------------------------------------------------------
>
>                 Key: MAPREDUCE-1729
>                 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1729
>             Project: Hadoop Map/Reduce
>          Issue Type: New Feature
>          Components: distributed-cache
>            Reporter: Amareshwari Sriramadasu
>            Assignee: Akira AJISAKA
>
> Currently, distributed cache fails the job if the cache file gets modified on 
> the fly. But there should be an option to fail a job or not.
> See discussions in MAPREDUCE-1288.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to