[jira] [Commented] (MAPREDUCE-1729) Distributed cache should provide an option to fail the job or not, if cache file gets modified on the fly.

2013-08-27 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13751722#comment-13751722
 ] 

Akira AJISAKA commented on MAPREDUCE-1729:
--

[~yamashitasni], thanks for your comment.
I'll try to implement your 2nd option.

 Distributed cache should provide an option to fail the job or not, if cache 
 file gets modified on the fly.
 --

 Key: MAPREDUCE-1729
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1729
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: distributed-cache
Reporter: Amareshwari Sriramadasu
Assignee: Akira AJISAKA

 Currently, distributed cache fails the job if the cache file gets modified on 
 the fly. But there should be an option to fail a job or not.
 See discussions in MAPREDUCE-1288.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (MAPREDUCE-1729) Distributed cache should provide an option to fail the job or not, if cache file gets modified on the fly.

2013-08-26 Thread Shinichi Yamashita (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13750707#comment-13750707
 ] 

Shinichi Yamashita commented on MAPREDUCE-1729:
---

Hi,

When a target file is updated during MapReduce Job, the result will change.
Because it may cause inconsistencies in the result of MapReduce Job, the 
architecture of the current DistributedCache is valid.
But it may set the following options,

1. Addition of a property the update of the target file in MapReduce Job
2. It copies a target file before MapReduce Job start in another directory and 
use that file

By any policy, this know-how should document it as an important usage of 
DistributedCache.

 Distributed cache should provide an option to fail the job or not, if cache 
 file gets modified on the fly.
 --

 Key: MAPREDUCE-1729
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1729
 Project: Hadoop Map/Reduce
  Issue Type: New Feature
  Components: distributed-cache
Reporter: Amareshwari Sriramadasu
Assignee: Akira AJISAKA

 Currently, distributed cache fails the job if the cache file gets modified on 
 the fly. But there should be an option to fail a job or not.
 See discussions in MAPREDUCE-1288.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (MAPREDUCE-1729) Distributed cache should provide an option to fail the job or not, if cache file gets modified on the fly.

2013-07-15 Thread Akira AJISAKA (JIRA)

[ 
https://issues.apache.org/jira/browse/MAPREDUCE-1729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13708801#comment-13708801
 ] 

Akira AJISAKA commented on MAPREDUCE-1729:
--

Hi, I need this option.

A long-term job failed in our production environment because the file which was 
used as distributed cache was modified at fixed intervals. As the output of the 
job would be better if distributed cache is newer, we don't want to fail the 
job if cache file gets modified on the fly. Our workaround is, copy original 
file to tmpfile and use tmpfile as distributed cache. If the option exists, we 
don't need to copy original file before the job begin.

 Distributed cache should provide an option to fail the job or not, if cache 
 file gets modified on the fly.
 --

 Key: MAPREDUCE-1729
 URL: https://issues.apache.org/jira/browse/MAPREDUCE-1729
 Project: Hadoop Map/Reduce
  Issue Type: Bug
  Components: distributed-cache
Reporter: Amareshwari Sriramadasu

 Currently, distributed cache fails the job if the cache file gets modified on 
 the fly. But there should be an option to fail a job or not.
 See discussions in MAPREDUCE-1288.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira