[jira] [Commented] (YARN-7337) Expose per-node over-allocation info in Node Report

2017-11-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16246880#comment-16246880 ] Haibo Chen commented on YARN-7337: -- Updated the patch to -depreciate exisiting getUsed api. -add

[jira] [Updated] (YARN-7337) Expose per-node over-allocation info in Node Report

2017-11-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7337: - Attachment: YARN-7337-YARN-1011.03.patch > Expose per-node over-allocation info in Node Report >

[jira] [Commented] (YARN-7337) Expose per-node over-allocation info in Node Report

2017-11-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16247039#comment-16247039 ] Haibo Chen commented on YARN-7337: -- The unit tests are unrelated. Will update the patch to address

[jira] [Commented] (YARN-7346) Fix compilation errors against hbase2 alpha release

2017-11-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16244182#comment-16244182 ] Haibo Chen commented on YARN-7346: -- Please help me understand this. The mapreduce.tar.gz is shipped for

[jira] [Commented] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-11-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16244381#comment-16244381 ] Haibo Chen commented on YARN-7388: -- Thanks [~rkanter] for the review! killContainer() is solely called in

[jira] [Updated] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-11-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7388: - Attachment: YARN-7388.01.patch > TestAMRestart should be scheduler agnostic >

[jira] [Commented] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-11-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16244622#comment-16244622 ] Haibo Chen commented on YARN-7388: -- I believe the OOM-led test failures are unrelated, let me retrigger

[jira] [Commented] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-11-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16245042#comment-16245042 ] Haibo Chen commented on YARN-7388: -- The unit test failure is unrelated, tracked at YARN-5684 >

[jira] [Updated] (YARN-7337) Expose per-node over-allocation info in Node Report

2017-11-03 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7337: - Attachment: YARN-7337-YARN-1011.01.patch > Expose per-node over-allocation info in Node Report >

[jira] [Commented] (YARN-7337) Expose per-node over-allocation info in Node Report

2017-11-03 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16238623#comment-16238623 ] Haibo Chen commented on YARN-7337: -- Patch updated to address the unit test failures and checkstyle issues.

[jira] [Commented] (YARN-1015) FS should watch node resource utilization and allocate opportunistic containers if appropriate

2017-11-06 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16241281#comment-16241281 ] Haibo Chen commented on YARN-1015: -- Thanks [~asuresh] for the review! bq. I don't think you should even

[jira] [Commented] (YARN-1015) FS should watch node resource utilization and allocate opportunistic containers if appropriate

2017-11-06 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-1015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16241282#comment-16241282 ] Haibo Chen commented on YARN-1015: -- I have not, however, investigated how involved this idea is to

[jira] [Updated] (YARN-7581) ATSv2 does not construct HBase filters correctly in HBase 2.0

2017-12-01 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7581: - Description: TestTimelineReaderWebServicesHBaseStorage.testGetEntitiesConfigFilters() and

[jira] [Commented] (YARN-7346) Fix compilation errors against hbase2 alpha release

2017-12-01 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16274496#comment-16274496 ] Haibo Chen commented on YARN-7346: -- [~rohithsharma] [~vrushalic]. The patch should now be ready for

[jira] [Created] (YARN-7602) NM should reference the singleton JvmMetrics instance

2017-12-03 Thread Haibo Chen (JIRA)
Haibo Chen created YARN-7602: Summary: NM should reference the singleton JvmMetrics instance Key: YARN-7602 URL: https://issues.apache.org/jira/browse/YARN-7602 Project: Hadoop YARN Issue Type:

[jira] [Commented] (YARN-7390) All reservation related test cases failed when TestYarnClient runs against Fair Scheduler.

2017-10-25 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16219625#comment-16219625 ] Haibo Chen commented on YARN-7390: -- Like Yufei said, we have considered parameterizing all unit tests that

[jira] [Commented] (YARN-7390) All reservation related test cases failed when TestYarnClient runs against Fair Scheduler.

2017-10-25 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16219729#comment-16219729 ] Haibo Chen commented on YARN-7390: -- I see. Was thinking that the failure was caused merely by

[jira] [Commented] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-25 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16218975#comment-16218975 ] Haibo Chen commented on YARN-4511: -- [~leftnoteasy] Would you like to look at the patch as well while I am

[jira] [Commented] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217679#comment-16217679 ] Haibo Chen commented on YARN-4511: -- bq. Have another patch for the changes to the swapContainer etc. The

[jira] [Updated] (YARN-7358) TestZKConfigurationStore and TestLeveldbConfigurationStore should explicitly set capacity scheduler

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7358?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7358: - Attachment: YARN-7358.01.patch > TestZKConfigurationStore and TestLeveldbConfigurationStore should

[jira] [Commented] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217727#comment-16217727 ] Haibo Chen commented on YARN-4511: -- Thanks [~jlowe] for the comment. My apologies for raising alarms by

[jira] [Created] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-10-24 Thread Haibo Chen (JIRA)
Haibo Chen created YARN-7388: Summary: TestAMRestart should be scheduler agnostic Key: YARN-7388 URL: https://issues.apache.org/jira/browse/YARN-7388 Project: Hadoop YARN Issue Type: Bug

[jira] [Updated] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7388: - Attachment: YARN-7388.00.patch > TestAMRestart should be scheduler agnostic >

[jira] [Commented] (YARN-7358) TestZKConfigurationStore and TestLeveldbConfigurationStore should explicitly set capacity scheduler

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217713#comment-16217713 ] Haibo Chen commented on YARN-7358: -- Updated the patch to address the checkstyle indentation issue. >

[jira] [Commented] (YARN-7388) TestAMRestart should be scheduler agnostic

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16217874#comment-16217874 ] Haibo Chen commented on YARN-7388: -- There was no unit test failure even though it return -1. I think it is

[jira] [Commented] (YARN-7389) Make TestResourceManager Scheduler agnostic

2017-10-24 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16218108#comment-16218108 ] Haibo Chen commented on YARN-7389: -- TestFSAppStarvation.testPreemptionEnabled is tracked at YARN-6747. +1

[jira] [Commented] (YARN-7355) TestDistributedShell should be scheduler agnostic

2017-10-20 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7355?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16213160#comment-16213160 ] Haibo Chen commented on YARN-7355: -- Thanks @Yufei for the review! > TestDistributedShell should be

[jira] [Commented] (YARN-7372) TestContainerSchedulerQueuing.testContainerUpdateExecTypeGuaranteedToOpportunistic is flaky

2017-10-20 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7372?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16213265#comment-16213265 ] Haibo Chen commented on YARN-7372: -- Thanks [~asuresh] for the review! Will check it in shortly. >

[jira] [Updated] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-21 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-4511: - Attachment: YARN-4511-YARN-1011.07.patch Patch update to address comments + added a unit test for

[jira] [Commented] (YARN-7412) test_docker_util.test_check_mount_permitted() is failing

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7412?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227148#comment-16227148 ] Haibo Chen commented on YARN-7412: -- [~vvasudev] Can you please take a look at the fix? I am not quite

[jira] [Commented] (YARN-7390) All reservation related test cases failed when TestYarnClient runs against Fair Scheduler.

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7390?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227136#comment-16227136 ] Haibo Chen commented on YARN-7390: -- Thanks @Yufei for the update! 1) testAMRMToken() should probably call

[jira] [Created] (YARN-7421) Preserve execution type for containers to be promoted by AM post YARN-1015

2017-10-31 Thread Haibo Chen (JIRA)
Haibo Chen created YARN-7421: Summary: Preserve execution type for containers to be promoted by AM post YARN-1015 Key: YARN-7421 URL: https://issues.apache.org/jira/browse/YARN-7421 Project: Hadoop YARN

[jira] [Updated] (YARN-7421) Preserve execution type for containers to be increased by AM post YARN-1015

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7421: - Summary: Preserve execution type for containers to be increased by AM post YARN-1015 (was: Preserve

[jira] [Updated] (YARN-7421) Preserve execution type for containers to be increased by AM post YARN-1015

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7421: - Issue Type: Sub-task (was: Bug) Parent: YARN-1011 > Preserve execution type for containers to be

[jira] [Commented] (YARN-7178) Add documentation for Container Update API

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227291#comment-16227291 ] Haibo Chen commented on YARN-7178: -- [~asuresh] While looking at YARN-7421, it seems that the execution

[jira] [Comment Edited] (YARN-7178) Add documentation for Container Update API

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227291#comment-16227291 ] Haibo Chen edited comment on YARN-7178 at 10/31/17 6:36 PM: [~asuresh] While

[jira] [Commented] (YARN-7421) Preserve execution type for containers to be increased by AM post YARN-1015

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227299#comment-16227299 ] Haibo Chen commented on YARN-7421: -- Looks like ResourceRequests for container increase/promotion are

[jira] [Resolved] (YARN-7421) Preserve execution type for containers to be increased by AM post YARN-1015

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7421?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen resolved YARN-7421. -- Resolution: Not A Problem Based on,

[jira] [Commented] (YARN-7389) Make TestResourceManager Scheduler agnostic

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227429#comment-16227429 ] Haibo Chen commented on YARN-7389: -- Yes. Forgot to cherry-pick into branch-3.0. > Make

[jira] [Commented] (YARN-7178) Add documentation for Container Update API

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227320#comment-16227320 ] Haibo Chen commented on YARN-7178: -- bq. it is not really used right now - and was put in there as a

[jira] [Commented] (YARN-7389) Make TestResourceManager Scheduler agnostic

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227524#comment-16227524 ] Haibo Chen commented on YARN-7389: -- Thanks for doing that! > Make TestResourceManager Scheduler agnostic

[jira] [Comment Edited] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227519#comment-16227519 ] Haibo Chen edited comment on YARN-6940 at 10/31/17 8:48 PM: [~asuresh] A quick

[jira] [Commented] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227519#comment-16227519 ] Haibo Chen commented on YARN-6940: -- [~asuresh] A quick question. If we only do node-local container update

[jira] [Commented] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227721#comment-16227721 ] Haibo Chen commented on YARN-6940: -- Do we mean the warning message? I changed the relaxLocality to false,

[jira] [Commented] (YARN-6940) FairScheduler: Enable Container update CodePaths and container resize testcase

2017-10-31 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6940?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16227756#comment-16227756 ] Haibo Chen commented on YARN-6940: -- Never mind. Talked with [~rkanter] offline. This is the correct way

[jira] [Commented] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-30 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16225984#comment-16225984 ] Haibo Chen commented on YARN-4511: -- Thanks [~asuresh] for your review! bq. Is there a case where the

[jira] [Updated] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-30 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-4511: - Attachment: YARN-4511-YARN-1011.10.patch > Common scheduler changes supporting scheduler-specific

[jira] [Commented] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-30 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16225973#comment-16225973 ] Haibo Chen commented on YARN-4511: -- Thanks [~leftnoteasy] for the review! bq. assert in the main code

[jira] [Commented] (YARN-4511) Common scheduler changes supporting scheduler-specific implementations

2017-10-30 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16226021#comment-16226021 ] Haibo Chen commented on YARN-4511: -- Filed YARN-7337 for SchedulerNodeReport changes because I think there

[jira] [Updated] (YARN-8244) TestContainerSchedulerQueuing.testStartMultipleContainers failed

2018-05-04 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8244: - Summary: TestContainerSchedulerQueuing.testStartMultipleContainers failed (was:

[jira] [Commented] (YARN-7715) Update CPU and Memory cgroups params on container update as well.

2018-05-04 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16464221#comment-16464221 ] Haibo Chen commented on YARN-7715: -- Thanks [~miklos.szeg...@cloudera.com] for the patch! I have two

[jira] [Updated] (YARN-8244) TestContainerSchedulerQueuing.testStartMultipleContainers failed

2018-05-04 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8244: - Description: {code:java}

[jira] [Commented] (YARN-7715) Update CPU and Memory cgroups params on container update as well.

2018-05-06 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16465285#comment-16465285 ] Haibo Chen commented on YARN-7715: -- I think the new patch would still call updateContainer() even when the

[jira] [Created] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-04 Thread Haibo Chen (JIRA)
Haibo Chen created YARN-8250: Summary: Create another implementation of ContainerScheduler to support NM overallocation Key: YARN-8250 URL: https://issues.apache.org/jira/browse/YARN-8250 Project: Hadoop

[jira] [Updated] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-04 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8250: - Issue Type: Sub-task (was: Improvement) Parent: YARN-1011 > Create another implementation of

[jira] [Updated] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8250: - Attachment: YARN-8250-YARN-1011.00.patch > Create another implementation of ContainerScheduler to support

[jira] [Updated] (YARN-8090) Race conditions in FadvisedChunkedFile

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8090: - Priority: Minor (was: Major) > Race conditions in FadvisedChunkedFile >

[jira] [Updated] (YARN-8090) Race conditions in FadvisedChunkedFile

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8090: - Description: When a file is closed mutple times by multiple threads, all but the first close will

[jira] [Commented] (YARN-6675) Add NM support to launch opportunistic containers based on overallocation

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16467949#comment-16467949 ] Haibo Chen commented on YARN-6675: -- Per offline discussion with [~miklos.szeg...@cloudera.com], reverting

[jira] [Commented] (YARN-7715) Update CPU and Memory cgroups params on container update as well.

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468012#comment-16468012 ] Haibo Chen commented on YARN-7715: -- Makes sense. Can you add containerId to all the warning message to

[jira] [Commented] (YARN-8090) Race conditions in FadvisedChunkedFile

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468048#comment-16468048 ] Haibo Chen commented on YARN-8090: -- Thanks for the patch, [~miklos.szeg...@cloudera.com]. We can get file

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16474715#comment-16474715 ] Haibo Chen commented on YARN-8250: -- [~asuresh] Did you get a change to look at the patch? > Create

[jira] [Updated] (YARN-6677) Preempt all opportunistic containers when root container cgroup goes over memory limit

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-6677?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-6677: - Summary: Preempt all opportunistic containers when root container cgroup goes over memory limit (was:

[jira] [Commented] (YARN-7933) [atsv2 read acls] Add TimelineWriter#writeDomain

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16474759#comment-16474759 ] Haibo Chen commented on YARN-7933: -- If it's not in our design, I am inclined to remove it at this point in

[jira] [Commented] (YARN-7933) [atsv2 read acls] Add TimelineWriter#writeDomain

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16474761#comment-16474761 ] Haibo Chen commented on YARN-7933: -- I am okay with just remove the TODO comment, and have the discussion

[jira] [Commented] (YARN-4599) Set OOM control for memory cgroups

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-4599?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16474543#comment-16474543 ] Haibo Chen commented on YARN-4599: -- Thanks [~miklos.szeg...@cloudera.com] for the patch! The

[jira] [Commented] (YARN-8130) Race condition when container events are published for KILLED applications

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16474552#comment-16474552 ] Haibo Chen commented on YARN-8130: -- Checking this in later today if no objection > Race condition when

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16475011#comment-16475011 ] Haibo Chen commented on YARN-8250: -- My understanding of SHED_QUEUED_CONTAINERS is to notify container

[jira] [Commented] (YARN-8248) Job hangs when a queue is specified and the maxResources of the queue cannot satisfy the AM resource request

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16475058#comment-16475058 ] Haibo Chen commented on YARN-8248: -- {quote}as {{RMAppManager.validateAndCreateResourceRequest()}} can

[jira] [Updated] (YARN-8248) Job hangs when a job requests a resource that its queue does not have

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8248: - Summary: Job hangs when a job requests a resource that its queue does not have (was: Job hangs when a

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-14 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16475100#comment-16475100 ] Haibo Chen commented on YARN-8250: -- [~leftnoteasy] Thanks for your comments. I agree that we should avoid

[jira] [Commented] (YARN-8090) Race conditions in FadvisedChunkedFile

2018-05-08 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8090?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16468132#comment-16468132 ] Haibo Chen commented on YARN-8090: -- +1 on the patch pending  Jenkins. The precommit job

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469001#comment-16469001 ] Haibo Chen commented on YARN-8250: -- Thanks for the review, [~miklos.szeg...@cloudera.com]!

[jira] [Updated] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8250: - Attachment: YARN-8250-YARN-1011.01.patch > Create another implementation of ContainerScheduler to support

[jira] [Updated] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8250: - Attachment: YARN-8250-YARN-1011.02.patch > Create another implementation of ContainerScheduler to support

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469398#comment-16469398 ] Haibo Chen commented on YARN-8250: -- This is based on UpdateContainerTokenEvent. There are four

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469356#comment-16469356 ] Haibo Chen commented on YARN-8250: -- The unit test failures are unrelated. The ones in

[jira] [Updated] (YARN-7190) Ensure only NM classpath in 2.x gets TSv2 related hbase jars, not the user classpath

2018-04-27 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7190: - Affects Version/s: 3.0.3 3.0.1 > Ensure only NM classpath in 2.x gets TSv2 related

[jira] [Updated] (YARN-7190) Ensure only NM classpath in 2.x gets TSv2 related hbase jars, not the user classpath

2018-04-27 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7190?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7190: - Affects Version/s: 2.9.0 3.0.2 3.0.x > Ensure only NM

[jira] [Commented] (YARN-8250) Create another implementation of ContainerScheduler to support NM overallocation

2018-05-09 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8250?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16469577#comment-16469577 ] Haibo Chen commented on YARN-8250: -- Not sure how to address the last checkstyle issue, because we'd ignore

[jira] [Commented] (YARN-8130) Race condition when container events are published for KILLED applications

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470442#comment-16470442 ] Haibo Chen commented on YARN-8130: -- [~rohithsharma] I have one question about the race condition scenerio.

[jira] [Commented] (YARN-7933) [atsv2 read acls] Add TimelineWriter#writeDomain

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470490#comment-16470490 ] Haibo Chen commented on YARN-7933: -- {quote} For storing in TimelineEntity, we should discuss it separately

[jira] [Updated] (YARN-8107) Give an informative message when incorrect format is used in ATSv2 filter attributes

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8107: - Issue Type: Sub-task (was: Bug) Parent: YARN-1011 > Give an informative message when incorrect

[jira] [Updated] (YARN-8129) Improve error message for invalid value in fields attribute

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8129: - Issue Type: Sub-task (was: Improvement) Parent: YARN-1011 > Improve error message for invalid

[jira] [Updated] (YARN-8129) Improve error message for invalid value in fields attribute

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8129?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8129: - Parent Issue: YARN-7055 (was: YARN-1011) > Improve error message for invalid value in fields attribute >

[jira] [Updated] (YARN-8107) Give an informative message when incorrect format is used in ATSv2 filter attributes

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8107?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8107: - Parent Issue: YARN-7055 (was: YARN-1011) > Give an informative message when incorrect format is used in

[jira] [Updated] (YARN-8132) Final Status of applications shown as UNDEFINED in ATS app queries

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8132: - Issue Type: Sub-task (was: Bug) Parent: YARN-7055 > Final Status of applications shown as

[jira] [Updated] (YARN-8253) HTTPS Ats v2 api call fails with "bad HTTP parsed"

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8253: - Issue Type: Sub-task (was: Bug) Parent: YARN-7055 > HTTPS Ats v2 api call fails with "bad HTTP

[jira] [Updated] (YARN-8270) Adding JMX Metrics for Timeline Collector and Reader

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8270: - Issue Type: Sub-task (was: Improvement) Parent: YARN-7055 > Adding JMX Metrics for Timeline

[jira] [Updated] (YARN-8130) Race condition when container events are published for KILLED applications

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8130: - Issue Type: Sub-task (was: Bug) Parent: YARN-7055 > Race condition when container events are

[jira] [Commented] (YARN-8132) Final Status of applications shown as UNDEFINED in ATS app queries

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8132?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470462#comment-16470462 ] Haibo Chen commented on YARN-8132: -- Both YARN_APPLICATION_STATE and YARN_APPLICATON_FINAL_STATUS are

[jira] [Updated] (YARN-8247) Incorrect HTTP status code returned by ATSv2 for non-whitelisted users

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8247?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8247: - Issue Type: Sub-task (was: Bug) Parent: YARN-7055 > Incorrect HTTP status code returned by ATSv2

[jira] [Updated] (YARN-8215) ATS v2 returns invalid YARN_CONTAINER_ALLOCATED_HOST_HTTP_ADDRESS from NM

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-8215: - Issue Type: Sub-task (was: Bug) Parent: YARN-7055 > ATS v2 returns invalid

[jira] [Commented] (YARN-7715) Update CPU and Memory cgroups params on container update as well.

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16470866#comment-16470866 ] Haibo Chen commented on YARN-7715: -- +1. Checking this in shortly. > Update CPU and Memory cgroups params

[jira] [Updated] (YARN-7715) Support NM promotion/demotion of running containers.

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7715?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Haibo Chen updated YARN-7715: - Summary: Support NM promotion/demotion of running containers. (was: Update CPU and Memory cgroups params

[jira] [Commented] (YARN-8248) Job hangs when queue is specified and that queue has 0 capability of a resource

2018-05-10 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16471309#comment-16471309 ] Haibo Chen commented on YARN-8248: -- Thanks [~snemeth] for the patch. I have some questions. 1) Why the

[jira] [Commented] (YARN-7933) [atsv2 read acls] Add TimelineWriter#writeDomain

2018-05-11 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16472077#comment-16472077 ] Haibo Chen commented on YARN-7933: -- [~rohithsharma] There is TODO comment in TimelineCollector.putDomain()

[jira] [Commented] (YARN-7933) [atsv2 read acls] Add TimelineWriter#writeDomain

2018-05-11 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-7933?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16472082#comment-16472082 ] Haibo Chen commented on YARN-7933: -- While thinking of the appId issue, there's one question that occurred

[jira] [Commented] (YARN-8130) Race condition when container events are published for KILLED applications

2018-05-11 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8130?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16472010#comment-16472010 ] Haibo Chen commented on YARN-8130: -- [~rohithsharma] Do you think it's viable that we just generate a new

[jira] [Commented] (YARN-8268) Fair scheduler: reservable queue is configured both as parent and leaf queue

2018-05-11 Thread Haibo Chen (JIRA)
[ https://issues.apache.org/jira/browse/YARN-8268?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16472423#comment-16472423 ] Haibo Chen commented on YARN-8268: -- +1. Checking this in shortly. > Fair scheduler: reservable queue is

<    6   7   8   9   10   11   12   13   14   15   >