Wangda Tan created YARN-10535:
-
Summary: Make changes in queue placement policy to use
auto-queue-placement API in CapacityScheduler
Key: YARN-10535
URL: https://issues.apache.org/jira/browse/YARN-10535
P
Wangda Tan created YARN-10532:
-
Summary: Capacity Scheduler Auto Queue Creation: Allow auto delete
queue when queue is not being used
Key: YARN-10532
URL: https://issues.apache.org/jira/browse/YARN-10532
Wangda Tan created YARN-10531:
-
Summary: Be able to disable user limit factor for
CapacityScheduler Leaf Queue
Key: YARN-10531
URL: https://issues.apache.org/jira/browse/YARN-10531
Project: Hadoop YARN
Wangda Tan created YARN-10530:
-
Summary: CapacityScheduler ResourceLimits doesn't handle node
partition well
Key: YARN-10530
URL: https://issues.apache.org/jira/browse/YARN-10530
Project: Hadoop YARN
Wangda Tan created YARN-10497:
-
Summary: Fix an issue in CapacityScheduler which fails to delete
queues
Key: YARN-10497
URL: https://issues.apache.org/jira/browse/YARN-10497
Project: Hadoop YARN
Wangda Tan created YARN-10496:
-
Summary: [Umbrella] Support Flexible Auto Queue Creation in
Capacity Scheduler
Key: YARN-10496
URL: https://issues.apache.org/jira/browse/YARN-10496
Project: Hadoop YARN
Wangda Tan created YARN-10380:
-
Summary: Import logic of multi-node allocation in CapacityScheduler
Key: YARN-10380
URL: https://issues.apache.org/jira/browse/YARN-10380
Project: Hadoop YARN
Issu
[
https://issues.apache.org/jira/browse/YARN-10151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-10151.
---
Resolution: Won't Fix
Thanks folks for commenting about YARN-9838. I think we don't need this change
Wangda Tan created YARN-10170:
-
Summary: Should revisit mix-usage of percentage-based and
absolute-value-based min/max resource in CapacityScheduler
Key: YARN-10170
URL: https://issues.apache.org/jira/browse/YARN-1017
Wangda Tan created YARN-10169:
-
Summary: Mixed absolute resource value and percentage-based
resource value in CapacityScheduler should fail
Key: YARN-10169
URL: https://issues.apache.org/jira/browse/YARN-10169
Wangda Tan created YARN-10168:
-
Summary: FS-CS Convert: Converter tool doesn't handle min/max
resource conversion correct
Key: YARN-10168
URL: https://issues.apache.org/jira/browse/YARN-10168
Project: Had
Wangda Tan created YARN-10167:
-
Summary: Need validate c-s.xml after converting
Key: YARN-10167
URL: https://issues.apache.org/jira/browse/YARN-10167
Project: Hadoop YARN
Issue Type: Sub-task
Wangda Tan created YARN-10151:
-
Summary: Disable Capacity Scheduler's move app between queue
functionality
Key: YARN-10151
URL: https://issues.apache.org/jira/browse/YARN-10151
Project: Hadoop YARN
[
https://issues.apache.org/jira/browse/YARN-8975?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8975.
--
Resolution: Fixed
Hadoop Flags: Reviewed
Fix Version/s: 3.3.0
Committed to trunk, thanks [~t
[
https://issues.apache.org/jira/browse/YARN-9020?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-9020.
--
Resolution: Duplicate
Thanks [~jutia] for reporting this. It is a valid issue.
This is dup of YARN-8917
Wangda Tan created YARN-8993:
Summary: [Submarine] Add support to run deep learning workload in
non-Docker containers
Key: YARN-8993
URL: https://issues.apache.org/jira/browse/YARN-8993
Project: Hadoop YA
[
https://issues.apache.org/jira/browse/YARN-8237?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8237.
--
Resolution: Duplicate
> mxnet yarn spec file to add to native service examples
> ---
[
https://issues.apache.org/jira/browse/YARN-8238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8238.
--
Resolution: Fixed
Closing as dup of YARN-8135.
> [Umbrella] YARN deep learning framework examples to r
[
https://issues.apache.org/jira/browse/YARN-8513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8513.
--
Resolution: Duplicate
Fix Version/s: (was: 3.2.1)
(was: 3.1.2)
Reopen
[
https://issues.apache.org/jira/browse/YARN-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8896.
--
Resolution: Fixed
> Limit the maximum number of container assignments per heartbeat
> --
Wangda Tan created YARN-8858:
Summary: CapacityScheduler should respect maximum node resource
when per-queue maximum-allocation is being used.
Key: YARN-8858
URL: https://issues.apache.org/jira/browse/YARN-8858
Wangda Tan created YARN-8817:
Summary: [Submarine] In some cases HDFS is not asked by user when
submit job but framework requires user to set HDFS related environments
Key: YARN-8817
URL: https://issues.apache.org/jir
[
https://issues.apache.org/jira/browse/YARN-8799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8799.
--
Resolution: Duplicate
This should be duplicated by YARN-8757.
> [Submarine] Correct the default direct
Wangda Tan created YARN-8800:
Summary: Updated documentation of Submarine with latest examples.
Key: YARN-8800
URL: https://issues.apache.org/jira/browse/YARN-8800
Project: Hadoop YARN
Issue Type
Wangda Tan created YARN-8770:
Summary: [Submarine] Support using Submarine to submit Pytorch job
Key: YARN-8770
URL: https://issues.apache.org/jira/browse/YARN-8770
Project: Hadoop YARN
Issue Typ
Wangda Tan created YARN-8769:
Summary: [Submarine] Allow user to specify customized quicklink(s)
when submit Submarine job
Key: YARN-8769
URL: https://issues.apache.org/jira/browse/YARN-8769
Project: Hado
Wangda Tan created YARN-8757:
Summary: [Submarine] Add Tensorboard component when --tensorboard
is specified
Key: YARN-8757
URL: https://issues.apache.org/jira/browse/YARN-8757
Project: Hadoop YARN
Wangda Tan created YARN-8756:
Summary: [Submarine] Properly handle relative path for staging area
Key: YARN-8756
URL: https://issues.apache.org/jira/browse/YARN-8756
Project: Hadoop YARN
Issue Ty
Wangda Tan created YARN-8716:
Summary: [Submarine] Support passing Kerberos principle tokens
when launch training jobs.
Key: YARN-8716
URL: https://issues.apache.org/jira/browse/YARN-8716
Project: Hadoop
Wangda Tan created YARN-8713:
Summary: [Submarine] Support deploy model serving for existing
models
Key: YARN-8713
URL: https://issues.apache.org/jira/browse/YARN-8713
Project: Hadoop YARN
Issue
Wangda Tan created YARN-8714:
Summary: [Submarine] Support files/tarballs to be localized for a
training job.
Key: YARN-8714
URL: https://issues.apache.org/jira/browse/YARN-8714
Project: Hadoop YARN
Wangda Tan created YARN-8712:
Summary: [Submarine] Support create models / versions for training
result.
Key: YARN-8712
URL: https://issues.apache.org/jira/browse/YARN-8712
Project: Hadoop YARN
Wangda Tan created YARN-8657:
Summary: User limit calculation should be read-lock-protected
within LeafQueue
Key: YARN-8657
URL: https://issues.apache.org/jira/browse/YARN-8657
Project: Hadoop YARN
Wangda Tan created YARN-8563:
Summary: Support users to specify Python/TF
package/version/dependencies for training job.
Key: YARN-8563
URL: https://issues.apache.org/jira/browse/YARN-8563
Project: Hadoop
Wangda Tan created YARN-8561:
Summary: Add submarine initial implementation: training job
submission and job history retrieve.
Key: YARN-8561
URL: https://issues.apache.org/jira/browse/YARN-8561
Project:
Wangda Tan created YARN-8545:
Summary: YARN native service should return container if launch
failed
Key: YARN-8545
URL: https://issues.apache.org/jira/browse/YARN-8545
Project: Hadoop YARN
Issue
Wangda Tan created YARN-8506:
Summary: Make GetApplicationsRequestPBImpl thread safe
Key: YARN-8506
URL: https://issues.apache.org/jira/browse/YARN-8506
Project: Hadoop YARN
Issue Type: Task
Wangda Tan created YARN-8489:
Summary: Need to support customer termination policy for native
services
Key: YARN-8489
URL: https://issues.apache.org/jira/browse/YARN-8489
Project: Hadoop YARN
Is
Wangda Tan created YARN-8488:
Summary: Need to add "SUCCEED" state to YARN service
Key: YARN-8488
URL: https://issues.apache.org/jira/browse/YARN-8488
Project: Hadoop YARN
Issue Type: Task
[
https://issues.apache.org/jira/browse/YARN-8478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8478.
--
Resolution: Duplicate
> The capacity scheduler logs too frequently seriously affecting performance
> ---
Wangda Tan created YARN-8466:
Summary: Add Chaos Monkey unit test framework for validation in
scale
Key: YARN-8466
URL: https://issues.apache.org/jira/browse/YARN-8466
Project: Hadoop YARN
Issue
Wangda Tan created YARN-8459:
Summary: Capacity Scheduler should properly handle container
allocation on app/node when app/node being removed by scheduler
Key: YARN-8459
URL: https://issues.apache.org/jira/browse/YARN
Wangda Tan created YARN-8417:
Summary: Should skip passing HDFS_HOME, HADOOP_CONF_DIR,
JAVA_HOME, etc. to Docker container.
Key: YARN-8417
URL: https://issues.apache.org/jira/browse/YARN-8417
Project: Had
[
https://issues.apache.org/jira/browse/YARN-8220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8220.
--
Resolution: Later
> Running Tensorflow on YARN with GPU and Docker - Examples
>
Wangda Tan created YARN-8379:
Summary: Add an option to allow Capacity Scheduler preemption to
balance satisfied queues
Key: YARN-8379
URL: https://issues.apache.org/jira/browse/YARN-8379
Project: Hadoop
Wangda Tan created YARN-8343:
Summary: YARN should have ability to run images only from a
whitelist docker registries
Key: YARN-8343
URL: https://issues.apache.org/jira/browse/YARN-8343
Project: Hadoop YA
Wangda Tan created YARN-8342:
Summary: Using docker image from a non-privileged registry, the
launch_command is not honored
Key: YARN-8342
URL: https://issues.apache.org/jira/browse/YARN-8342
Project: Had
Wangda Tan created YARN-8340:
Summary: Capacity Scheduler Intra Queue Preemption Should Work
When 3rd or more resources enabled.
Key: YARN-8340
URL: https://issues.apache.org/jira/browse/YARN-8340
Project
[
https://issues.apache.org/jira/browse/YARN-8272?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-8272.
--
Resolution: Duplicate
Closing as dup of HADOOP-15374
> Several items are missing from Hadoop 3.1.0 docum
Wangda Tan created YARN-8272:
Summary: Several items are missing from Hadoop 3.1.0 documentation
Key: YARN-8272
URL: https://issues.apache.org/jira/browse/YARN-8272
Project: Hadoop YARN
Issue Typ
Wangda Tan created YARN-8257:
Summary: Native service should automatically adding escapes for
environment/launch cmd before sending to YARN
Key: YARN-8257
URL: https://issues.apache.org/jira/browse/YARN-8257
Wangda Tan created YARN-8149:
Summary: Revisit behavior of Re-Reservation in Capacity Scheduler
Key: YARN-8149
URL: https://issues.apache.org/jira/browse/YARN-8149
Project: Hadoop YARN
Issue Type
Wangda Tan created YARN-8141:
Summary: YARN Native Service: Respect
YARN_CONTAINER_RUNTIME_DOCKER_LOCAL_RESOURCE_MOUNTS specified in service spec
Key: YARN-8141
URL: https://issues.apache.org/jira/browse/YARN-8141
Wangda Tan created YARN-8135:
Summary: Hadoop {Submarine} Project: Simple and scalable
deployment of deep learning training / serving jobs on Hadoop
Key: YARN-8135
URL: https://issues.apache.org/jira/browse/YARN-8135
Wangda Tan created YARN-8109:
Summary: Resource Manager WebApps fails to start due to
ConcurrentModificationException
Key: YARN-8109
URL: https://issues.apache.org/jira/browse/YARN-8109
Project: Hadoop Y
Wangda Tan created YARN-8091:
Summary: Revisit checkUserAccessToQueue RM REST API
Key: YARN-8091
URL: https://issues.apache.org/jira/browse/YARN-8091
Project: Hadoop YARN
Issue Type: Task
[
https://issues.apache.org/jira/browse/YARN-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-5881.
--
Resolution: Done
> Enable configuration of queue capacity in terms of absolute resources
> --
Wangda Tan created YARN-8084:
Summary: Yarn native service rename for easier development?
Key: YARN-8084
URL: https://issues.apache.org/jira/browse/YARN-8084
Project: Hadoop YARN
Issue Type: Task
Wangda Tan created YARN-8080:
Summary: YARN native service should support component restart
policy
Key: YARN-8080
URL: https://issues.apache.org/jira/browse/YARN-8080
Project: Hadoop YARN
Issue
Wangda Tan created YARN-8079:
Summary: YARN native service should respect source file of
ConfigFile inside Service/Component spec
Key: YARN-8079
URL: https://issues.apache.org/jira/browse/YARN-8079
Projec
[
https://issues.apache.org/jira/browse/YARN-5983?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-5983.
--
Resolution: Done
Fix Version/s: 3.1.0
Since this feature works end to end and landed in 3.1.0, clo
[
https://issues.apache.org/jira/browse/YARN-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-6223.
--
Resolution: Done
Fix Version/s: 3.1.0
Closing as done since all sub tasks are done.
> [Umbrella]
[
https://issues.apache.org/jira/browse/YARN-5326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-5326.
--
Resolution: Done
> Support for recurring reservations in the YARN ReservationSystem
> ---
[
https://issues.apache.org/jira/browse/YARN-7303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7303.
--
Resolution: Done
Closing as "done" since there's no patch committed with the Jira.
> Merge YARN-5734 bra
[
https://issues.apache.org/jira/browse/YARN-7873?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7873.
--
Resolution: Invalid
> Revert YARN-6078
>
>
> Key: YARN-7873
>
Wangda Tan created YARN-8046:
Summary: Revisit RMWebServiceProtocol implementations
Key: YARN-8046
URL: https://issues.apache.org/jira/browse/YARN-8046
Project: Hadoop YARN
Issue Type: Improvemen
Wangda Tan created YARN-8028:
Summary: Support authorizeUserAccessToQueue in RMWebServices
Key: YARN-8028
URL: https://issues.apache.org/jira/browse/YARN-8028
Project: Hadoop YARN
Issue Type: Imp
Wangda Tan created YARN-7920:
Summary: Cleanup configuration of PlacementConstraints
Key: YARN-7920
URL: https://issues.apache.org/jira/browse/YARN-7920
Project: Hadoop YARN
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/YARN-7854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7854.
--
Resolution: Later
> Attach prefixes to different type of node attributes
> --
[
https://issues.apache.org/jira/browse/YARN-7759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7759.
--
Resolution: Duplicate
Duplicated by YARN-7817
> [UI2]GPU chart shows as "Available: 0" even though GPU i
Wangda Tan created YARN-7817:
Summary: Add Resource reference to RM's NodeInfo object so REST
API can get non memory/vcore resource usages.
Key: YARN-7817
URL: https://issues.apache.org/jira/browse/YARN-7817
Wangda Tan created YARN-7807:
Summary: By default do intra-app anti-affinity for scheduling
request inside app placement allocator
Key: YARN-7807
URL: https://issues.apache.org/jira/browse/YARN-7807
Proje
Wangda Tan created YARN-7801:
Summary: AmFilterInitializer should addFilter after fill all
parameters
Key: YARN-7801
URL: https://issues.apache.org/jira/browse/YARN-7801
Project: Hadoop YARN
Iss
Wangda Tan created YARN-7790:
Summary: Improve Capacity Scheduler Async Scheduling to better
handle node failures
Key: YARN-7790
URL: https://issues.apache.org/jira/browse/YARN-7790
Project: Hadoop YARN
Wangda Tan created YARN-7789:
Summary: Should fail RM if 3rd resource type is configured but RM
uses DefaultResourceCalculator
Key: YARN-7789
URL: https://issues.apache.org/jira/browse/YARN-7789
Project:
Wangda Tan created YARN-7763:
Summary: Refactoring PlacementConstraintUtils APIs so
PlacementProcessor/Scheduler can use the same API and implementation
Key: YARN-7763
URL: https://issues.apache.org/jira/browse/YARN-7
Wangda Tan created YARN-7739:
Summary: Revisit scheduler resource normalization behavior for max
allocation
Key: YARN-7739
URL: https://issues.apache.org/jira/browse/YARN-7739
Project: Hadoop YARN
Wangda Tan created YARN-7723:
Summary: Avoid using docker volume --format option to compatible
to older docker releases
Key: YARN-7723
URL: https://issues.apache.org/jira/browse/YARN-7723
Project: Hadoop
Wangda Tan created YARN-7718:
Summary: DistributedShell failed to specify resource other than
memory/vcores from container_resources
Key: YARN-7718
URL: https://issues.apache.org/jira/browse/YARN-7718
Pro
Wangda Tan created YARN-7709:
Summary: Remove SELF from TargetExpression type .
Key: YARN-7709
URL: https://issues.apache.org/jira/browse/YARN-7709
Project: Hadoop YARN
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/YARN-7416?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7416.
--
Resolution: Duplicate
Duplicated by YARN-7487.
> Use "docker volume inspect" to make sure that volumes f
[
https://issues.apache.org/jira/browse/YARN-7509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7509.
--
Resolution: Fixed
Fix Version/s: (was: 3.0.1)
3.0.0
> AsyncScheduleThread a
Wangda Tan created YARN-7555:
Summary: Support multiple resource types in YARN native services
Key: YARN-7555
URL: https://issues.apache.org/jira/browse/YARN-7555
Project: Hadoop YARN
Issue Type:
Wangda Tan created YARN-7522:
Summary: Add application tags manager implementation
Key: YARN-7522
URL: https://issues.apache.org/jira/browse/YARN-7522
Project: Hadoop YARN
Issue Type: Sub-task
Wangda Tan created YARN-7487:
Summary: Make sure volume includes GPU base libraries exists after
created by plugin
Key: YARN-7487
URL: https://issues.apache.org/jira/browse/YARN-7487
Project: Hadoop YARN
Wangda Tan created YARN-7457:
Summary: Delay scheduling should be an individual policy instead
of part of scheduler implementation
Key: YARN-7457
URL: https://issues.apache.org/jira/browse/YARN-7457
Proje
Wangda Tan created YARN-7442:
Summary: [YARN-7069] Limit format of resource type name
Key: YARN-7442
URL: https://issues.apache.org/jira/browse/YARN-7442
Project: Hadoop YARN
Issue Type: Sub-task
Wangda Tan created YARN-7438:
Summary: Additional changes to make SchedulingPlacementSet
agnostic to ResourceRequest / placement algorithm
Key: YARN-7438
URL: https://issues.apache.org/jira/browse/YARN-7438
Wangda Tan created YARN-7437:
Summary: Give SchedulingPlacementSet to a better name.
Key: YARN-7437
URL: https://issues.apache.org/jira/browse/YARN-7437
Project: Hadoop YARN
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/YARN-5908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-5908.
--
Resolution: Duplicate
Duplicated to YARN-6952
> Add affinity/anti-affinity field to ResourceRequest API
Wangda Tan created YARN-7416:
Summary: Use "docker volume inspect" to make sure that volumes for
GPU drivers/libs are properly mounted.
Key: YARN-7416
URL: https://issues.apache.org/jira/browse/YARN-7416
Wangda Tan created YARN-7330:
Summary: Add support to show GPU on UI/metrics
Key: YARN-7330
URL: https://issues.apache.org/jira/browse/YARN-7330
Project: Hadoop YARN
Issue Type: Sub-task
Wangda Tan created YARN-7318:
Summary: Fix shell check warnings of SLS.
Key: YARN-7318
URL: https://issues.apache.org/jira/browse/YARN-7318
Project: Hadoop YARN
Issue Type: Bug
Report
[
https://issues.apache.org/jira/browse/YARN-4122?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-4122.
--
Resolution: Duplicate
This is duplicated by YARN-6620, closing as dup.
> Add support for GPU as a resour
Wangda Tan created YARN-7307:
Summary: Revisit resource-types.xml loading behaviors
Key: YARN-7307
URL: https://issues.apache.org/jira/browse/YARN-7307
Project: Hadoop YARN
Issue Type: Sub-task
Wangda Tan created YARN-7292:
Summary: Revisit Resource Profile Behavior
Key: YARN-7292
URL: https://issues.apache.org/jira/browse/YARN-7292
Project: Hadoop YARN
Issue Type: Sub-task
[
https://issues.apache.org/jira/browse/YARN-7249?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Wangda Tan resolved YARN-7249.
--
Resolution: Invalid
Sorry for the noise, it is not an issue for 2.8 as well. Closing as invalid.
> Fix C
Wangda Tan created YARN-7249:
Summary: Fix CapacityScheduler NPE issue when a container
preempted while the node is being removed
Key: YARN-7249
URL: https://issues.apache.org/jira/browse/YARN-7249
Projec
Wangda Tan created YARN-7242:
Summary: Support support specify values of different resource
types in DistributedShell for easier testing
Key: YARN-7242
URL: https://issues.apache.org/jira/browse/YARN-7242
Wangda Tan created YARN-7237:
Summary: Cleanup usages of ResourceProfiles
Key: YARN-7237
URL: https://issues.apache.org/jira/browse/YARN-7237
Project: Hadoop YARN
Issue Type: Sub-task
1 - 100 of 417 matches
Mail list logo