[
https://issues.apache.org/jira/browse/HADOOP-4988?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Vivek Ratan updated HADOOP-4988:
--------------------------------
Attachment: 4988.2.patch
bq. If a queue has no capacity, we should not be giving a task.
Good catch. I had forgotten about this check. That has been added, I've synced
with trunk, and a new patch (4988.2.patch) is attached. I've run dos2unix on
it, and the output of ant test-patch is below:
{code}
[exec] +1 overall.
[exec]
[exec] +1 @author. The patch does not contain any @author tags.
[exec]
[exec] +1 tests included. The patch appears to include 3 new or
modified tests.
[exec]
[exec] +1 javadoc. The javadoc tool did not generate any warning
messages.
[exec]
[exec] +1 javac. The applied patch does not increase the total number
of javac compiler warnings.
[exec]
[exec] +1 findbugs. The patch does not introduce any new Findbugs
warnings.
[exec]
[exec] +1 Eclipse classpath. The patch retains Eclipse classpath
integrity.
{code}
> An earlier fix, for HADOOP-4373, results in a problem with reclaiming
> capacity when one or more queues have a capacity equal to zero
> ------------------------------------------------------------------------------------------------------------------------------------
>
> Key: HADOOP-4988
> URL: https://issues.apache.org/jira/browse/HADOOP-4988
> Project: Hadoop Core
> Issue Type: Bug
> Components: contrib/capacity-sched
> Reporter: Vivek Ratan
> Priority: Blocker
> Attachments: 4988.1.patch, 4988.2.patch
>
>
> HADOOP-4373 introduced a fix for queues with guaranteed capacity (gc) equal
> to zero. Part of the fix was in the queue comparator used to sort queues.
> Queues with gc=0 were placed at the end. This causes a problem with the code
> for reclaiming capacity, which assumes that queues are sorted based on free
> space available and that a queue with gc=0 is no different than a queue which
> is running at capacity. Because of this, the following problem can arise: if
> we have a system with at least one queue whose gc=0, we may fail to reclaim
> capacity for some queues.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.