[
https://issues.apache.org/jira/browse/HADOOP-5049?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12666424#action_12666424
]
Owen O'Malley commented on HADOOP-5049:
---------------------------------------
No, I'm saying that the correct behavior should be to run the init, reduce, and
cleanup tasks, even when there are 0 maps. Think of a system that wants to run
every 5 minutes to do roll ups. I think it makes more sense to have a directory
with empty output files than a missing one if there was no input data.
In terms of the fix to the immediate problem, I think that all changes to the
job state should be set to the listeners. Doesn't that solve the original
problem?
> Jobs with 0 maps will never get removed from the default scheduler
> ------------------------------------------------------------------
>
> Key: HADOOP-5049
> URL: https://issues.apache.org/jira/browse/HADOOP-5049
> Project: Hadoop Core
> Issue Type: Bug
> Components: mapred
> Reporter: Amar Kamat
> Assignee: Amar Kamat
> Priority: Blocker
> Fix For: 0.20.0
>
> Attachments: HADOOP-5049-v1.1.patch
>
>
> Jobs' with 0 maps finish/succeed in the init phase i.e while the job is in
> the _PREP_ state. {{EagerTaskInitializationListener}} removes the job after
> initing but {{JobQueueJobInProgressListener}} waits for a job-state change
> event to be raised and aonly then removes the job from the queue and hence
> the job will stay forever with the {{JobQueueJobInProgressListener}}. Looks
> like {{FairScheduler}} periodically scans the job list and removes completed
> jobs. {{CapacityScheduler}} has a concept of waiting jobs and scans waiting
> queue for completed jobs and purges them.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.