[
https://issues.apache.org/jira/browse/HADOOP-5719?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sreekanth Ramakrishnan updated HADOOP-5719:
-------------------------------------------
Attachment: HADOOP-5719-2.patch
Attaching patch incorporating most of Vinod's comments.
bq. I think a better place for job removal from the JobQueuesManager is the
cleanUpInitializedJobsList() method of teh JobInitializationPoller. We may want
to rename this method and change its javadoc a bit.
This has not been incorporated because of issue described in HADOOP-5020 it is
hit when {{JobInProgress.initTasks()}} throws an exception and terminate job is
called and Capacity scheduler would never be able to remove the job from the
waiting queue.
Also added a new test case {{TestJobInitalizationPoller}} which uses
{{MiniMRCluster}} to verify if jobs failing initialization are actually removed
from waiting queue.
> Jobs failed during job initalization are never removed from Capacity
> Schedulers waiting list
> --------------------------------------------------------------------------------------------
>
> Key: HADOOP-5719
> URL: https://issues.apache.org/jira/browse/HADOOP-5719
> Project: Hadoop Core
> Issue Type: Bug
> Components: contrib/capacity-sched
> Reporter: Sreekanth Ramakrishnan
> Assignee: Sreekanth Ramakrishnan
> Attachments: HADOOP-5719-1.patch, HADOOP-5719-2.patch
>
>
> Jobs which fail during initalization are never removed from Capacity
> Schedulers waiting job list.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.