[jira] [Commented] (YARN-8967) Change FairScheduler to use PlacementRule interface

2019-01-02 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732713#comment-16732713 ] Hadoop QA commented on YARN-8967: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-9171) Replace incorrect use of system property user.name

2019-01-02 Thread Dinesh Chitlangia (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dinesh Chitlangia updated YARN-9171: Description: This jira has been created to track the suggested changes for YARN as

[jira] [Comment Edited] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732663#comment-16732663 ] lujie edited comment on YARN-9164 at 1/3/19 4:54 AM: - Hi:

[jira] [Commented] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732663#comment-16732663 ] lujie commented on YARN-9164: - Hi: [~cheersyang],[~jlowe],[~leftnoteasy] I don't think the unit failure

[jira] [Commented] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732661#comment-16732661 ] Hadoop QA commented on YARN-9164: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8967) Change FairScheduler to use PlacementRule interface

2019-01-02 Thread Wilfred Spiegelenburg (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wilfred Spiegelenburg updated YARN-8967: Attachment: YARN-8967.002.patch > Change FairScheduler to use PlacementRule

[jira] [Commented] (YARN-8967) Change FairScheduler to use PlacementRule interface

2019-01-02 Thread Wilfred Spiegelenburg (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732656#comment-16732656 ] Wilfred Spiegelenburg commented on YARN-8967: - new patch rebased to the latest trunk: fixed

[jira] [Resolved] (YARN-9172) Correct the typo related to "DominantResourceCalculator" in error message of CapacityScheduler when resource types is more than two

2019-01-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang resolved YARN-9172. Resolution: Fixed Already fixed in trunk. Closed this JIRA. > Correct the typo related to

[jira] [Commented] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Tao Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732634#comment-16732634 ] Tao Yang commented on YARN-9163: Hi [~ziqian hu] We have communicated about this problem offline before,

[jira] [Updated] (YARN-9172) Correct the typo related to "DominantResourceCalculator" in error message of CapacityScheduler when resource types is more than two

2019-01-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhankun Tang updated YARN-9172: --- Summary: Correct the typo related to "DominantResourceCalculator" in error message of

[jira] [Created] (YARN-9172) Correct the typo "DominantResourceCalculator" in CapacityScheduler

2019-01-02 Thread Zhankun Tang (JIRA)
Zhankun Tang created YARN-9172: -- Summary: Correct the typo "DominantResourceCalculator" in CapacityScheduler Key: YARN-9172 URL: https://issues.apache.org/jira/browse/YARN-9172 Project: Hadoop YARN

[jira] [Commented] (YARN-8967) Change FairScheduler to use PlacementRule interface

2019-01-02 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732619#comment-16732619 ] Hadoop QA commented on YARN-8967: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Updated] (YARN-8967) Change FairScheduler to use PlacementRule interface

2019-01-02 Thread Wilfred Spiegelenburg (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wilfred Spiegelenburg updated YARN-8967: Attachment: YARN-8967.001.patch > Change FairScheduler to use PlacementRule

[jira] [Commented] (YARN-9161) Absolute resources of capacity scheduler doesn't support GPU and FPGA

2019-01-02 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732613#comment-16732613 ] Zac Zhou commented on YARN-9161: Thanks a lot, [~leftnoteasy], [~sunilg] {code:java} // W/o unit for

[jira] [Commented] (YARN-9168) DistributedShell client timeout should be -1 by default

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732614#comment-16732614 ] Weiwei Yang commented on YARN-9168: --- Hi [~tangzhankun] {quote}And the timeout value "-1" can be used as

[jira] [Commented] (YARN-9147) Auxiliary manifest file deleted from HDFS does not trigger service to be removed

2019-01-02 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732601#comment-16732601 ] Hadoop QA commented on YARN-9147: - | (/) *{color:green}+1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9161) Absolute resources of capacity scheduler doesn't support GPU and FPGA

2019-01-02 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732594#comment-16732594 ] Sunil Govindan commented on YARN-9161: -- Yes [~leftnoteasy]. By design, there are no such limits. It

[jira] [Updated] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lujie updated YARN-9164: Attachment: YARN-9164-2.patch > NullPointerException crash the ResourceManager >

[jira] [Commented] (YARN-9168) DistributedShell client timeout should be -1 by default

2019-01-02 Thread Zhankun Tang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732582#comment-16732582 ] Zhankun Tang commented on YARN-9168: [~cheersyang], Yeah. Thanks for reviewing this. I'm fine with the

[jira] [Updated] (YARN-9147) Auxiliary manifest file deleted from HDFS does not trigger service to be removed

2019-01-02 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billie Rinaldi updated YARN-9147: - Attachment: YARN-9147.1.patch > Auxiliary manifest file deleted from HDFS does not trigger

[jira] [Created] (YARN-9171) Replace incorrect use of system property user.name

2019-01-02 Thread Dinesh Chitlangia (JIRA)
Dinesh Chitlangia created YARN-9171: --- Summary: Replace incorrect use of system property user.name Key: YARN-9171 URL: https://issues.apache.org/jira/browse/YARN-9171 Project: Hadoop YARN

[jira] [Commented] (YARN-9003) Support multi-homed network for docker container

2019-01-02 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732563#comment-16732563 ] Hadoop QA commented on YARN-9003: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Comment Edited] (YARN-9003) Support multi-homed network for docker container

2019-01-02 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732523#comment-16732523 ] Eric Yang edited comment on YARN-9003 at 1/2/19 11:46 PM: -- Docker inspect shows

[jira] [Commented] (YARN-9003) Support multi-homed network for docker container

2019-01-02 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732523#comment-16732523 ] Eric Yang commented on YARN-9003: - Docker inspect shows network settings that looks like this: {code}

[jira] [Updated] (YARN-9003) Support multi-homed network for docker container

2019-01-02 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Yang updated YARN-9003: Attachment: YARN-9003.003.patch > Support multi-homed network for docker container >

[jira] [Commented] (YARN-9148) AggregatedLogDeletion doesnt work with S3

2019-01-02 Thread Steve Loughran (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9148?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732362#comment-16732362 ] Steve Loughran commented on YARN-9148: -- that's because directories don't actually exist: something

[jira] [Commented] (YARN-9116) Capacity Scheduler: add the default maximum-allocation-mb and maximum-allocation-vcores for the queues

2019-01-02 Thread Aihua Xu (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732283#comment-16732283 ] Aihua Xu commented on YARN-9116: Thanks [~cheersyang] for the comment. Happy new year. So you are

[jira] [Commented] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread Hadoop QA (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732279#comment-16732279 ] Hadoop QA commented on YARN-9164: - | (x) *{color:red}-1 overall{color}* | \\ \\ || Vote || Subsystem ||

[jira] [Commented] (YARN-9161) Absolute resources of capacity scheduler doesn't support GPU and FPGA

2019-01-02 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732270#comment-16732270 ] Wangda Tan commented on YARN-9161: -- [~yuan_zac], thanks for reporting this issue. [~sunilg], do you

[jira] [Created] (YARN-9170) Name Node Format Exception showing

2019-01-02 Thread RasmiRanjan Biswal (JIRA)
RasmiRanjan Biswal created YARN-9170: Summary: Name Node Format Exception showing Key: YARN-9170 URL: https://issues.apache.org/jira/browse/YARN-9170 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Wangda Tan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732255#comment-16732255 ] Wangda Tan commented on YARN-9163: -- [~ziqian hu], could u upload jstack or at least 3 full stacktrace of

[jira] [Commented] (YARN-7904) Privileged, trusted containers need all of their bind-mounted directories to be read-only

2019-01-02 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732257#comment-16732257 ] Eric Yang commented on YARN-7904: - When privileged container is running as someone else, the root file

[jira] [Comment Edited] (YARN-9003) Support multi-homed network for docker container

2019-01-02 Thread Eric Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9003?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16729302#comment-16729302 ] Eric Yang edited comment on YARN-9003 at 1/2/19 5:36 PM: - [~yuan_zac] Have you

[jira] [Commented] (YARN-9147) Auxiliary manifest file deleted from HDFS does not trigger service to be removed

2019-01-02 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732234#comment-16732234 ] Billie Rinaldi commented on YARN-9147: -- Good idea. Thanks, [~eyang]. > Auxiliary manifest file

[jira] [Assigned] (YARN-9147) Auxiliary manifest file deleted from HDFS does not trigger service to be removed

2019-01-02 Thread Billie Rinaldi (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Billie Rinaldi reassigned YARN-9147: Assignee: Billie Rinaldi > Auxiliary manifest file deleted from HDFS does not trigger

[jira] [Comment Edited] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732149#comment-16732149 ] Weiwei Yang edited comment on YARN-9163 at 1/2/19 3:41 PM: --- Hi [~ziqian hu]

[jira] [Commented] (YARN-9168) DistributedShell client timeout should be -1 by default

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732169#comment-16732169 ] Weiwei Yang commented on YARN-9168: --- Hi [~tangzhankun] I think it introduced such a finite timeout was

[jira] [Commented] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732168#comment-16732168 ] lujie commented on YARN-9164: - Hi : [~cheersyang] I have changed the patch as your suggestion: (1) fix

[jira] [Updated] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread lujie (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] lujie updated YARN-9164: Attachment: YARN-9164-1.patch > NullPointerException crash the ResourceManager >

[jira] [Commented] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732150#comment-16732150 ] Weiwei Yang commented on YARN-9163: --- +[~Tao Yang] in the loop > Deadlock when use yarn rmadmin

[jira] [Commented] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732149#comment-16732149 ] Weiwei Yang commented on YARN-9163: --- Hi [~ziqian hu] This seems to be a blocker, or at least critical.

[jira] [Commented] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Sunil Govindan (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732143#comment-16732143 ] Sunil Govindan commented on YARN-9163: -- cc [~cheersyang] [~leftnoteasy] I can see that the bug is

[jira] [Comment Edited] (YARN-9157) Failed deletion dirs in yarn.nodemanager.local-dirs causes accumulation lots of files under the path yarn.nodemanager.local-dirs and causes operation systerm's I

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732126#comment-16732126 ] Weiwei Yang edited comment on YARN-9157 at 1/2/19 3:01 PM: --- Hi [~jj336013] If I

[jira] [Commented] (YARN-9157) Failed deletion dirs in yarn.nodemanager.local-dirs causes accumulation lots of files under the path yarn.nodemanager.local-dirs and causes operation systerm's Inode

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732126#comment-16732126 ] Weiwei Yang commented on YARN-9157: --- Hi [~jj336013] If I remember correctly, DeletionService should

[jira] [Updated] (YARN-9163) Deadlock when use yarn rmadmin -refreshQueues

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9163?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Weiwei Yang updated YARN-9163: -- Priority: Blocker (was: Major) > Deadlock when use yarn rmadmin -refreshQueues >

[jira] [Commented] (YARN-9164) NullPointerException crash the ResourceManager

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732086#comment-16732086 ] Weiwei Yang commented on YARN-9164: --- Hi [~xiaoheipangzi] I noticed this one and YARN-9165, they all

[jira] [Commented] (YARN-9165) NPE which is similar to YARN-5918

2019-01-02 Thread Weiwei Yang (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732081#comment-16732081 ] Weiwei Yang commented on YARN-9165: --- Hi [~xiaoheipangzi] Thanks for testing this and providing the fix.

[jira] [Commented] (YARN-9161) Absolute resources of capacity scheduler doesn't support GPU and FPGA

2019-01-02 Thread Zac Zhou (JIRA)
[ https://issues.apache.org/jira/browse/YARN-9161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16732006#comment-16732006 ] Zac Zhou commented on YARN-9161: I checked the failed test case TestRunJobCliParsing and found that it's

[jira] [Created] (YARN-9169) Add metrics for queued and paused containers.

2019-01-02 Thread Abhishek Modi (JIRA)
Abhishek Modi created YARN-9169: --- Summary: Add metrics for queued and paused containers. Key: YARN-9169 URL: https://issues.apache.org/jira/browse/YARN-9169 Project: Hadoop YARN Issue Type:

[jira] [Created] (YARN-9168) DistributedShell client timeout should be -1 by default

2019-01-02 Thread Zhankun Tang (JIRA)
Zhankun Tang created YARN-9168: -- Summary: DistributedShell client timeout should be -1 by default Key: YARN-9168 URL: https://issues.apache.org/jira/browse/YARN-9168 Project: Hadoop YARN Issue