[jira] [Commented] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302239#comment-17302239 ] Qi Zhu commented on YARN-10688: --- Thanks [~ebadger] for review. Updated it in latest patch. The test is

[jira] [Commented] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-15 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302234#comment-17302234 ] Hadoop QA commented on YARN-10688: -- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Resolved] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Akira Ajisaka (Jira)
[ https://issues.apache.org/jira/browse/YARN-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Akira Ajisaka resolved YARN-10694. -- Resolution: Duplicate Closing as dup. Thank you [~zhuqi] for the information. > Fix spotbugs

[jira] [Updated] (YARN-10690) GPU related improvement for better usage.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10690: -- Description: This Jira will improve GPU for better usage.   > GPU related improvement for better usage. >

[jira] [Comment Edited] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302174#comment-17302174 ] Qi Zhu edited comment on YARN-10694 at 3/16/21, 3:19 AM: - Thanks [~aajisaka] for

[jira] [Comment Edited] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302174#comment-17302174 ] Qi Zhu edited comment on YARN-10694 at 3/16/21, 3:18 AM: - Thanks [~aajisaka] for

[jira] [Comment Edited] (YARN-10503) Support queue capacity in terms of absolute resources with gpu resourceType.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10503?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17300121#comment-17300121 ] Qi Zhu edited comment on YARN-10503 at 3/16/21, 3:06 AM: - Updated a patch for

[jira] [Updated] (YARN-10503) Support queue capacity in terms of absolute resources with gpu resourceType.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10503?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10503: -- Parent: YARN-10690 Issue Type: Sub-task (was: Improvement) > Support queue capacity in terms of

[jira] [Commented] (YARN-10690) GPU related improvement for better usage.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302182#comment-17302182 ] Qi Zhu commented on YARN-10690: --- Thanks [~ebadger] for good suggestion. I have convert the related JIRAs

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Parent: YARN-10690 Issue Type: Sub-task (was: Improvement) > Add Node GPU Utilization and apply to

[jira] [Updated] (YARN-10616) Nodemanagers cannot detect GPU failures

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10616: -- Parent: YARN-10690 Issue Type: Sub-task (was: Bug) > Nodemanagers cannot detect GPU failures >

[jira] [Updated] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10688: -- Parent: YARN-10690 Issue Type: Sub-task (was: Improvement) > ClusterMetrics should support GPU

[jira] [Updated] (YARN-10690) GPU related improvement for better usage.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10690: -- Summary: GPU related improvement for better usage. (was: ClusterMetrics should support GPU related metrics.)

[jira] [Commented] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302174#comment-17302174 ] Qi Zhu commented on YARN-10694: --- Thanks [~aajisaka] for finding.  YARN-10689  should handle this. > Fix

[jira] [Updated] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10688: -- Attachment: YARN-10688.003.patch > ClusterMetrics should support GPU capacity related metrics. >

[jira] [Updated] (YARN-10690) ClusterMetrics should support GPU related metrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10690: -- Summary: ClusterMetrics should support GPU related metrics. (was: ClusterMetrics should support GPU

[jira] [Commented] (YARN-10689) Fix the findbugs issues in extractFloatValueFromWeightConfig.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302175#comment-17302175 ] Qi Zhu commented on YARN-10689: --- [~aajisaka] Could you help review this?  > Fix the findbugs issues in

[jira] [Commented] (YARN-10495) make the rpath of container-executor configurable

2021-03-15 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302168#comment-17302168 ] angerszhu commented on YARN-10495: -- [~ebadger] Since it's a issue start from hadoop-3.3.0, so we don't

[jira] [Commented] (YARN-10616) Nodemanagers cannot detect GPU failures

2021-03-15 Thread Zhankun Tang (Jira)
[ https://issues.apache.org/jira/browse/YARN-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302164#comment-17302164 ] Zhankun Tang commented on YARN-10616: - [~ebadger], Thanks for picking this up. The YARN-8823 had this

[jira] [Commented] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Akira Ajisaka (Jira)
[ https://issues.apache.org/jira/browse/YARN-10694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302133#comment-17302133 ] Akira Ajisaka commented on YARN-10694: -- After upgrading SpotBugs to 4.2.2 in HADOOP-16870, SpotBugs

[jira] [Created] (YARN-10694) Fix spotbugs warning in CapacitySchedulerConfiguration.java

2021-03-15 Thread Akira Ajisaka (Jira)
Akira Ajisaka created YARN-10694: Summary: Fix spotbugs warning in CapacitySchedulerConfiguration.java Key: YARN-10694 URL: https://issues.apache.org/jira/browse/YARN-10694 Project: Hadoop YARN

[jira] [Commented] (YARN-10690) ClusterMetrics should support GPU utilization related metrics.

2021-03-15 Thread Eric Badger (Jira)
[ https://issues.apache.org/jira/browse/YARN-10690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302009#comment-17302009 ] Eric Badger commented on YARN-10690: [~zhuqi], can we convert the related JIRAs to be subtasks of

[jira] [Commented] (YARN-10501) Can't remove all node labels after add node label without nodemanager port

2021-03-15 Thread Eric Badger (Jira)
[ https://issues.apache.org/jira/browse/YARN-10501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301993#comment-17301993 ] Eric Badger commented on YARN-10501: [~caozhiqiang], it doesn't need to be merged to 2.10.1. It has

[jira] [Commented] (YARN-10688) ClusterMetrics should support GPU capacity related metrics.

2021-03-15 Thread Eric Badger (Jira)
[ https://issues.apache.org/jira/browse/YARN-10688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301987#comment-17301987 ] Eric Badger commented on YARN-10688: [~zhuqi], thanks for the updated patch. To make things a little

[jira] [Updated] (YARN-10588) Percentage of queue and cluster is zero in WebUI

2021-03-15 Thread Eric Payne (Jira)
[ https://issues.apache.org/jira/browse/YARN-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Payne updated YARN-10588: -- Fix Version/s: 3.4.0 > Percentage of queue and cluster is zero in WebUI >

[jira] [Updated] (YARN-10495) make the rpath of container-executor configurable

2021-03-15 Thread Eric Badger (Jira)
[ https://issues.apache.org/jira/browse/YARN-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Badger updated YARN-10495: --- Fix Version/s: 3.3.1 [~angerszhu], I backported this to branch-3.3. There's a conflict past that. If

[jira] [Commented] (YARN-10588) Percentage of queue and cluster is zero in WebUI

2021-03-15 Thread Eric Payne (Jira)
[ https://issues.apache.org/jira/browse/YARN-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301831#comment-17301831 ] Eric Payne commented on YARN-10588: --- OK. Thanks [~BilwaST] and [~Jim_Brennan]! +1 I will commit this

[jira] [Commented] (YARN-10495) make the rpath of container-executor configurable

2021-03-15 Thread angerszhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10495?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301569#comment-17301569 ] angerszhu commented on YARN-10495: -- [~ebadger] Hi Eric, should we backport this pr to branch-3.3? since

[jira] [Created] (YARN-10693) Add document for YARN-10623 auto refresh queue conf in cs.

2021-03-15 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10693: - Summary: Add document for YARN-10623 auto refresh queue conf in cs. Key: YARN-10693 URL: https://issues.apache.org/jira/browse/YARN-10693 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Hadoop QA (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301519#comment-17301519 ] Hadoop QA commented on YARN-10692: -- | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-10691) DominantResourceCalculator isInvalidDivisor should consider only countable resource types

2021-03-15 Thread Bilwa S T (Jira)
[ https://issues.apache.org/jira/browse/YARN-10691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bilwa S T updated YARN-10691: - Summary: DominantResourceCalculator isInvalidDivisor should consider only countable resource types

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Attachment: (was: YARN-10692.001.patch) > Add Node GPU Utilization and apply to NodeMetrics. >

[jira] [Commented] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301452#comment-17301452 ] Qi Zhu commented on YARN-10692: --- [~pbacsko]  [~Jim_Brennan]  [~ebadger]  [~gandras]   Could you help

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Attachment: YARN-10692.001.patch > Add Node GPU Utilization and apply to NodeMetrics. >

[jira] [Updated] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
[ https://issues.apache.org/jira/browse/YARN-10692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Qi Zhu updated YARN-10692: -- Description: Now there are no node level GPU Utilization, this issue will add it, and add it to NodeMetrics

[jira] [Created] (YARN-10692) Add Node GPU Utilization and apply to NodeMetrics.

2021-03-15 Thread Qi Zhu (Jira)
Qi Zhu created YARN-10692: - Summary: Add Node GPU Utilization and apply to NodeMetrics. Key: YARN-10692 URL: https://issues.apache.org/jira/browse/YARN-10692 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-10588) Percentage of queue and cluster is zero in WebUI

2021-03-15 Thread Bilwa S T (Jira)
[ https://issues.apache.org/jira/browse/YARN-10588?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17301434#comment-17301434 ] Bilwa S T commented on YARN-10588: -- Thanks [~Jim_Brennan] and [~epayne] for review comments. I have

[jira] [Created] (YARN-10691) DominantResourceCalculator divide and ratio methods should consider only countable resource types

2021-03-15 Thread Bilwa S T (Jira)
Bilwa S T created YARN-10691: Summary: DominantResourceCalculator divide and ratio methods should consider only countable resource types Key: YARN-10691 URL: https://issues.apache.org/jira/browse/YARN-10691