[jira] [Assigned] (YARN-10890) Node Attributes in Distributed mapping misses update to scheduler when node gets decommissioned/recommissioned

2021-08-18 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi reassigned YARN-10890: --- Assignee: Tarun Parimi > Node Attributes in Distributed mapping misses update to scheduler

[jira] [Created] (YARN-10890) Node Attributes in Distributed mapping misses update to scheduler when node gets decommissioned/recommissioned

2021-08-18 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10890: --- Summary: Node Attributes in Distributed mapping misses update to scheduler when node gets decommissioned/recommissioned Key: YARN-10890 URL:

[jira] [Commented] (YARN-9907) Make YARN Service AM RPC port configurable

2021-07-30 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17390541#comment-17390541 ] Tarun Parimi commented on YARN-9907: [~pbacsko], yes you are right. We can close this as duplicate

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-07-14 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17380533#comment-17380533 ] Tarun Parimi commented on YARN-10789: - [~snemeth], Looks like the build didnt get triggered till now

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: (was: YARN-10789.branch-3.2.001.patch) > RM HA startup can fail due to race

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: YARN-10789.branch-3.2.001.patch > RM HA startup can fail due to race conditions in

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-25 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17369280#comment-17369280 ] Tarun Parimi commented on YARN-10789: - [~snemeth], reattaching the 3.2 patch to trigger build. Looks

[jira] [Commented] (YARN-10828) Backport YARN-9789 to branch-3.2

2021-06-25 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17369282#comment-17369282 ] Tarun Parimi commented on YARN-10828: - Thanks [~snemeth] for reviewing this and committting. >

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-25 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: (was: YARN-10789.branch-3.2.001.patch) > RM HA startup can fail due to race

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-25 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: YARN-10789.branch-3.2.001.patch > RM HA startup can fail due to race conditions in

[jira] [Commented] (YARN-10828) Backport YARN-9789 to branch-3.2

2021-06-24 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17368738#comment-17368738 ] Tarun Parimi commented on YARN-10828: - The test failures are not related to this patch. > Backport

[jira] [Commented] (YARN-10828) Backport YARN-9789 to branch-3.2

2021-06-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17367601#comment-17367601 ] Tarun Parimi commented on YARN-10828: - [~snemeth], please review this when you get time. Thanks. >

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17367600#comment-17367600 ] Tarun Parimi commented on YARN-10789: - [~snemeth], I have created YARN-10828 to backport YARN-9789 to

[jira] [Assigned] (YARN-10828) Backport YARN-9789 to branch-3.2

2021-06-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi reassigned YARN-10828: --- Assignee: Tarun Parimi Submitting a backport patch for branch-3.2. Validated that related

[jira] [Updated] (YARN-10828) Backport YARN-9789 to branch-3.2

2021-06-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10828: Attachment: YARN-10828.branch-3.2.001.patch > Backport YARN-9789 to branch-3.2 >

[jira] [Created] (YARN-10828) Backport YARN-9789 to branch-3.2

2021-06-22 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10828: --- Summary: Backport YARN-9789 to branch-3.2 Key: YARN-10828 URL: https://issues.apache.org/jira/browse/YARN-10828 Project: Hadoop YARN Issue Type: Bug

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17367553#comment-17367553 ] Tarun Parimi commented on YARN-10789: - [~snemeth], the failing test in TestZKConfigurationStore is

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-15 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: YARN-10789.branch-3.2.001.patch > RM HA startup can fail due to race conditions in

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-15 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17363576#comment-17363576 ] Tarun Parimi commented on YARN-10789: - Reattached Patch for branch-3.2 since jenkins triggerred only

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-15 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: (was: YARN-10789.branch-3.2.001.patch) > RM HA startup can fail due to race

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-14 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: YARN-10789.branch-3.3.001.patch YARN-10789.branch-3.2.001.patch > RM

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-06-14 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17362820#comment-17362820 ] Tarun Parimi commented on YARN-10789: - Thanks [~snemeth] for the review and commit. Thanks

[jira] [Commented] (YARN-10816) Avoid doing delegation token ops when yarn.timeline-service.http-authentication.type=simple

2021-06-14 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17362752#comment-17362752 ] Tarun Parimi commented on YARN-10816: - Thanks [~snemeth] for the review and commit. > Avoid doing

[jira] [Commented] (YARN-10816) Avoid doing delegation token ops when yarn.timeline-service.http-authentication.type=simple

2021-06-10 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17360708#comment-17360708 ] Tarun Parimi commented on YARN-10816: - [~snemeth], please review this when you get some time. >

[jira] [Updated] (YARN-10816) Avoid doing delegation token ops when yarn.timeline-service.http-authentication.type=simple

2021-06-10 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10816: Attachment: YARN-10816.002.patch > Avoid doing delegation token ops when >

[jira] [Updated] (YARN-10816) Avoid doing delegation token ops when yarn.timeline-service.http-authentication.type=simple

2021-06-10 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10816: Attachment: YARN-10816.001.patch > Avoid doing delegation token ops when >

[jira] [Created] (YARN-10816) Avoid doing delegation token ops when yarn.timeline-service.http-authentication.type=simple

2021-06-09 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10816: --- Summary: Avoid doing delegation token ops when yarn.timeline-service.http-authentication.type=simple Key: YARN-10816 URL: https://issues.apache.org/jira/browse/YARN-10816

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-31 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17354360#comment-17354360 ] Tarun Parimi commented on YARN-10789: - Thanks [~snemeth] . Please also take a look at this when you

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-27 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17352401#comment-17352401 ] Tarun Parimi commented on YARN-10789: - Thanks [~sunilg]. Added warn log in the latest patch. > RM HA

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-27 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: YARN-10789.002.patch > RM HA startup can fail due to race conditions in

[jira] [Commented] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-26 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17352290#comment-17352290 ] Tarun Parimi commented on YARN-10789: - Tested this patch only manually with a stability check with RM

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-26 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Attachment: YARN-10789.001.patch > RM HA startup can fail due to race conditions in

[jira] [Updated] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-26 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10789?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10789: Description: We are observing below error randomly during hadoop install and RM initial startup

[jira] [Created] (YARN-10789) RM HA startup can fail due to race conditions in ZKConfigurationStore

2021-05-26 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10789: --- Summary: RM HA startup can fail due to race conditions in ZKConfigurationStore Key: YARN-10789 URL: https://issues.apache.org/jira/browse/YARN-10789 Project: Hadoop

[jira] [Commented] (YARN-8564) Add queue level application lifetime monitor in FairScheduler

2021-05-18 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-8564?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17346849#comment-17346849 ] Tarun Parimi commented on YARN-8564: [~zhuqi], Any reason this jira got resolved? I don't see this

[jira] [Updated] (YARN-10007) YARN logs contain environment variables, which is a security risk

2020-12-15 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10007: Issue Type: New Feature (was: Bug) > YARN logs contain environment variables, which is a security

[jira] [Updated] (YARN-10458) Hive On Tez queries fails upon submission to dynamically created pools

2020-10-13 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10458?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10458: Description: While using Dynamic Auto-Creation and Management of Leaf Queues, we could see that

[jira] [Created] (YARN-10446) Capacity Scheduler page displays incorrect Configured Capacity

2020-09-23 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10446: --- Summary: Capacity Scheduler page displays incorrect Configured Capacity Key: YARN-10446 URL: https://issues.apache.org/jira/browse/YARN-10446 Project: Hadoop YARN

[jira] [Resolved] (YARN-10440) resource manager hangs,and i cannot submit any new jobs,but rm and nm processes are normal

2020-09-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10440?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi resolved YARN-10440. - Resolution: Duplicate Seems to be similar to YARN-8513 . The default config change in YARN-8896

[jira] [Commented] (YARN-10159) TimelineConnector does not destroy the jersey client

2020-09-04 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17190741#comment-17190741 ] Tarun Parimi commented on YARN-10159: - [~prabhujoseph] . This issue is there even for ats v1 client

[jira] [Updated] (YARN-10159) TimelineConnector does not destroy the jersey client

2020-09-04 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10159: Attachment: YARN-10159-branch-2.8.001.patch > TimelineConnector does not destroy the jersey client

[jira] [Commented] (YARN-10377) Clicking on queue in Capacity Scheduler legacy ui does not show any applications

2020-08-04 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17171258#comment-17171258 ] Tarun Parimi commented on YARN-10377: - Thanks for the review and commit [~prabhujoseph] > Clicking

[jira] [Commented] (YARN-10377) Clicking on queue in Capacity Scheduler legacy ui does not show any applications

2020-08-03 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17170087#comment-17170087 ] Tarun Parimi commented on YARN-10377: - Thanks [~prabhujoseph] . I have tested it manually and it

[jira] [Updated] (YARN-10377) Clicking on queue in Capacity Scheduler legacy ui does not show any applications

2020-08-03 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10377: Attachment: YARN-10377.001.patch > Clicking on queue in Capacity Scheduler legacy ui does not show

[jira] [Assigned] (YARN-10377) Clicking on queue in Capacity Scheduler legacy ui does not show any applications

2020-08-03 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi reassigned YARN-10377: --- Assignee: Tarun Parimi > Clicking on queue in Capacity Scheduler legacy ui does not show

[jira] [Resolved] (YARN-10378) When NM goes down and comes back up, PC allocation tags are not removed for completed containers

2020-07-30 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi resolved YARN-10378. - Resolution: Duplicate Looks like YARN-10034 fixes this issue for NM going down scenario also.

[jira] [Updated] (YARN-10378) When NM goes down and comes back up, PC allocation tags are not removed for completed containers

2020-07-30 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10378?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10378: Description: We are using placement constaints anti-affinity in an application along with node

[jira] [Created] (YARN-10378) When NM goes down and comes back up, PC allocation tags are not removed for completed containers

2020-07-30 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10378: --- Summary: When NM goes down and comes back up, PC allocation tags are not removed for completed containers Key: YARN-10378 URL: https://issues.apache.org/jira/browse/YARN-10378

[jira] [Created] (YARN-10377) Clicking on queue in Capacity Scheduler legacy ui does not show any applications

2020-07-29 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10377: --- Summary: Clicking on queue in Capacity Scheduler legacy ui does not show any applications Key: YARN-10377 URL: https://issues.apache.org/jira/browse/YARN-10377

[jira] [Commented] (YARN-10339) Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments

2020-07-17 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17159782#comment-17159782 ] Tarun Parimi commented on YARN-10339: - Thanks for the review [~prabhujoseph] > Timeline Client in

[jira] [Commented] (YARN-10340) HsWebServices getContainerReport uses loginUser instead of remoteUser to access ApplicationClientProtocol

2020-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17153225#comment-17153225 ] Tarun Parimi commented on YARN-10340: - [~prabhujoseph], The issue is because the

[jira] [Comment Edited] (YARN-10339) Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments

2020-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152548#comment-17152548 ] Tarun Parimi edited comment on YARN-10339 at 7/7/20, 8:17 AM: -- Thanks

[jira] [Commented] (YARN-10339) Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments

2020-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10339?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152548#comment-17152548 ] Tarun Parimi commented on YARN-10339: - Thanks [~prabhujoseph] . When atsv1 is enabled, delegation

[jira] [Updated] (YARN-10339) Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments

2020-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10339: Attachment: YARN-10339.002.patch > Timeline Client in Nodemanager gets 403 errors when simple auth

[jira] [Commented] (YARN-10340) HsWebServices getContainerReport uses loginUser instead of remoteUser to access ApplicationClientProtocol

2020-07-07 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10340?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17152501#comment-17152501 ] Tarun Parimi commented on YARN-10340: - [~prabhujoseph],[~brahmareddy] The WebServices#getContainer

[jira] [Updated] (YARN-10339) Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments

2020-07-06 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10339?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10339: Attachment: YARN-10339.001.patch > Timeline Client in Nodemanager gets 403 errors when simple auth

[jira] [Created] (YARN-10339) Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments

2020-07-06 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10339: --- Summary: Timeline Client in Nodemanager gets 403 errors when simple auth is used in kerberos environments Key: YARN-10339 URL: https://issues.apache.org/jira/browse/YARN-10339

[jira] [Comment Edited] (YARN-10283) Capacity Scheduler: starvation occurs if a higher priority queue is full and node labels are used

2020-05-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113342#comment-17113342 ] Tarun Parimi edited comment on YARN-10283 at 5/21/20, 4:31 PM: --- Thanks

[jira] [Commented] (YARN-10283) Capacity Scheduler: starvation occurs if a higher priority queue is full and node labels are used

2020-05-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10283?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17113342#comment-17113342 ] Tarun Parimi commented on YARN-10283: - Thanks for the repro test patch. The POC patch changes the

[jira] [Commented] (YARN-10240) Prevent Fatal CancelledException in TimelineV2Client when stopping

2020-04-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17088547#comment-17088547 ] Tarun Parimi commented on YARN-10240: - Thanks for the review [~prabhujoseph] > Prevent Fatal

[jira] [Assigned] (YARN-10240) Prevent Fatal CancelledException in TimelineV2Client when stopping

2020-04-20 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi reassigned YARN-10240: --- Assignee: Tarun Parimi > Prevent Fatal CancelledException in TimelineV2Client when stopping

[jira] [Updated] (YARN-10240) Prevent Fatal CancelledException in TimelineV2Client when stopping

2020-04-20 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10240: Attachment: YARN-10240.001.patch > Prevent Fatal CancelledException in TimelineV2Client when

[jira] [Created] (YARN-10240) Prevent Fatal CancelledException in TimelineV2Client when stopping

2020-04-20 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10240: --- Summary: Prevent Fatal CancelledException in TimelineV2Client when stopping Key: YARN-10240 URL: https://issues.apache.org/jira/browse/YARN-10240 Project: Hadoop YARN

[jira] [Updated] (YARN-9816) EntityGroupFSTimelineStore#scanActiveLogs fails when undesired files are present under /ats/active.

2020-03-18 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9816?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9816: --- Affects Version/s: 2.8.0 > EntityGroupFSTimelineStore#scanActiveLogs fails when undesired files are

[jira] [Commented] (YARN-9967) Fix NodeManager failing to start when Hdfs Auxillary Jar is set

2020-03-05 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9967?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17053104#comment-17053104 ] Tarun Parimi commented on YARN-9967: Hi [~snemeth], You can take it over. Thanks. > Fix NodeManager

[jira] [Updated] (YARN-10149) container-executor exits with 139 when the permissions of yarn log directory is improper

2020-02-18 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10149: Description: container-executor fails with segmentation fault and exit code 139 when the

[jira] [Updated] (YARN-10149) container-executor exits with 139 when the permissions of yarn log directory is improper

2020-02-18 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-10149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-10149: Description: container-executor fails with segmentation fault and exit code 139 when the

[jira] [Created] (YARN-10149) container-executor exits with 139 when the permissions of yarn log directory is improper

2020-02-18 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-10149: --- Summary: container-executor exits with 139 when the permissions of yarn log directory is improper Key: YARN-10149 URL: https://issues.apache.org/jira/browse/YARN-10149

[jira] [Commented] (YARN-9968) Public Localizer is exiting in NodeManager due to NullPointerException

2019-11-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16979324#comment-16979324 ] Tarun Parimi commented on YARN-9968: [~snemeth] , Please review this when you get time.  > Public

[jira] [Updated] (YARN-9968) Public Localizer is exiting in NodeManager due to NullPointerException

2019-11-13 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9968: --- Attachment: YARN-9968.001.patch > Public Localizer is exiting in NodeManager due to

[jira] [Comment Edited] (YARN-9968) Public Localizer is exiting in NodeManager due to NullPointerException

2019-11-13 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16973352#comment-16973352 ] Tarun Parimi edited comment on YARN-9968 at 11/13/19 1:56 PM: -- [~snemeth], I

[jira] [Commented] (YARN-9968) Public Localizer is exiting in NodeManager due to NullPointerException

2019-11-13 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16973352#comment-16973352 ] Tarun Parimi commented on YARN-9968: [~snemeth], I was finally able reproduce it artificially in my

[jira] [Comment Edited] (YARN-9925) CapacitySchedulerQueueManager allows unsupported Queue hierarchy

2019-11-13 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16973285#comment-16973285 ] Tarun Parimi edited comment on YARN-9925 at 11/13/19 12:08 PM: --- [~vinodkv] ,

[jira] [Commented] (YARN-9925) CapacitySchedulerQueueManager allows unsupported Queue hierarchy

2019-11-13 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16973285#comment-16973285 ] Tarun Parimi commented on YARN-9925: [~vinodkv] , it is fine for me. I was searching for the

[jira] [Commented] (YARN-9968) Public Localizer is exiting in NodeManager due to NullPointerException

2019-11-12 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9968?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16972420#comment-16972420 ] Tarun Parimi commented on YARN-9968: Hi [~snemeth]. Thanks for looking into this. The issue is not

[jira] [Created] (YARN-9968) Public Localizer is exiting in NodeManager due to NullPointerException

2019-11-12 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9968: -- Summary: Public Localizer is exiting in NodeManager due to NullPointerException Key: YARN-9968 URL: https://issues.apache.org/jira/browse/YARN-9968 Project: Hadoop YARN

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-24 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16958613#comment-16958613 ] Tarun Parimi commented on YARN-9921: Thanks for the reviews [~tangzhankun] and [~prabhujoseph#1] >

[jira] [Commented] (YARN-9772) CapacitySchedulerQueueManager has incorrect list of queues

2019-10-23 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957776#comment-16957776 ] Tarun Parimi commented on YARN-9772: The operators having several hundreds of queues might

[jira] [Commented] (YARN-9928) ATSv2 can make NM go down with a FATAL error while it is resyncing with RM

2019-10-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16957101#comment-16957101 ] Tarun Parimi commented on YARN-9928: The issue is occurring since container returned in below code

[jira] [Updated] (YARN-9928) ATSv2 can make NM go down with a FATAL error while it is resyncing with RM

2019-10-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9928: --- Component/s: ATSv2 > ATSv2 can make NM go down with a FATAL error while it is resyncing with RM >

[jira] [Updated] (YARN-9928) ATSv2 can make NM go down with a FATAL error while it is resyncing with RM

2019-10-22 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9928: --- Affects Version/s: 3.1.0 > ATSv2 can make NM go down with a FATAL error while it is resyncing with RM

[jira] [Created] (YARN-9928) ATSv2 can make NM go down with a FATAL error while it is resyncing with RM

2019-10-22 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9928: -- Summary: ATSv2 can make NM go down with a FATAL error while it is resyncing with RM Key: YARN-9928 URL: https://issues.apache.org/jira/browse/YARN-9928 Project: Hadoop

[jira] [Commented] (YARN-9773) Add QueueMetrics for Custom Resources

2019-10-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9773?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955958#comment-16955958 ] Tarun Parimi commented on YARN-9773: Got a findbugs warning from the changes done in this jira.

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955955#comment-16955955 ] Tarun Parimi commented on YARN-9921: The Findbugs warning is due to the changes done in YARN-9773 and

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-21 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955803#comment-16955803 ] Tarun Parimi commented on YARN-9921: Thanks for the review [~tangzhankun]. > Issue in

[jira] [Comment Edited] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-20 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955763#comment-16955763 ] Tarun Parimi edited comment on YARN-9921 at 10/21/19 5:55 AM: -- Submitting a

[jira] [Updated] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-20 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9921: --- Attachment: YARN-9921.001.patch > Issue in PlacementConstraint when YARN Service AM retries

[jira] [Commented] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-20 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16955755#comment-16955755 ] Tarun Parimi commented on YARN-9921: On debugging this, I found that the targetExpressions object is

[jira] [Updated] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-20 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9921: --- Attachment: differenceProtobuf.png > Issue in PlacementConstraint when YARN Service AM retries

[jira] [Created] (YARN-9921) Issue in PlacementConstraint when YARN Service AM retries allocation on component failure.

2019-10-20 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9921: -- Summary: Issue in PlacementConstraint when YARN Service AM retries allocation on component failure. Key: YARN-9921 URL: https://issues.apache.org/jira/browse/YARN-9921

[jira] [Updated] (YARN-9907) Make YARN Service AM RPC port configurable

2019-10-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9907?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9907: --- Attachment: YARN-9907.001.patch > Make YARN Service AM RPC port configurable >

[jira] [Created] (YARN-9907) Make YARN Service AM RPC port configurable

2019-10-16 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9907: -- Summary: Make YARN Service AM RPC port configurable Key: YARN-9907 URL: https://issues.apache.org/jira/browse/YARN-9907 Project: Hadoop YARN Issue Type: Bug

[jira] [Updated] (YARN-9903) Support reservations continue looking for Node Labels

2019-10-15 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9903: --- Description: YARN-1769 brought in reservations continue looking feature which improves the several

[jira] [Created] (YARN-9903) Support reservations continue looking for Node Labels

2019-10-15 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9903: -- Summary: Support reservations continue looking for Node Labels Key: YARN-9903 URL: https://issues.apache.org/jira/browse/YARN-9903 Project: Hadoop YARN Issue

[jira] [Commented] (YARN-8786) LinuxContainerExecutor fails sporadically in create_local_dirs

2019-09-19 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-8786?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16933274#comment-16933274 ] Tarun Parimi commented on YARN-8786: YARN-9833 could fix this issue > LinuxContainerExecutor fails

[jira] [Commented] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16931109#comment-16931109 ] Tarun Parimi commented on YARN-9837: Thanks for the review [~eyang] . > YARN Service fails to fetch

[jira] [Updated] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tarun Parimi updated YARN-9837: --- Attachment: YARN-9837.001.patch > YARN Service fails to fetch status for Stopped apps with bigger

[jira] [Created] (YARN-9837) YARN Service fails to fetch status for Stopped apps with bigger spec files

2019-09-16 Thread Tarun Parimi (Jira)
Tarun Parimi created YARN-9837: -- Summary: YARN Service fails to fetch status for Stopped apps with bigger spec files Key: YARN-9837 URL: https://issues.apache.org/jira/browse/YARN-9837 Project: Hadoop

[jira] [Commented] (YARN-9772) CapacitySchedulerQueueManager has incorrect list of queues

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930521#comment-16930521 ] Tarun Parimi commented on YARN-9772: bq. Should we extend the duplicates check (as of now, it does

[jira] [Commented] (YARN-9794) RM crashes due to runtime errors in TimelineServiceV2Publisher

2019-09-16 Thread Tarun Parimi (Jira)
[ https://issues.apache.org/jira/browse/YARN-9794?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16930503#comment-16930503 ] Tarun Parimi commented on YARN-9794: Thanks [~abmodi],[~Prabhu Joseph] for the reviews and commit. >

  1   2   3   >