[jira] [Resolved] (YUNIKORN-2635) test coverage improvement: same priority case in sorter

2024-05-26 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2635. Fix Version/s: 1.6.0 Resolution: Fixed > test coverage improvement: s

[jira] [Resolved] (YUNIKORN-2633) Unnecessary warning from Partition when adding an application

2024-05-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2633?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2633. Fix Version/s: 1.6.0 Resolution: Fixed > Unnecessary warning from Partition w

[jira] [Created] (YUNIKORN-2642) Don't set resources on the recovery queue

2024-05-24 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2642: -- Summary: Don't set resources on the recovery queue Key: YUNIKORN-2642 URL: https://issues.apache.org/jira/browse/YUNIKORN-2642 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2566) Remove AllocationAsk reference from askEvents

2024-05-23 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2566. Fix Version/s: 1.6.0 Resolution: Fixed > Remove AllocationAsk reference f

[jira] [Resolved] (YUNIKORN-2565) Remove Node reference from nodeEvents

2024-05-23 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2565. Fix Version/s: 1.6.0 Resolution: Fixed > Remove Node reference from nodeEve

[jira] [Resolved] (YUNIKORN-2618) Streamline AsyncRMCallback UpdateAllocation

2024-05-22 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2618?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2618. Fix Version/s: 1.6.0 Resolution: Fixed > Streamline AsyncRMCallb

[jira] [Resolved] (YUNIKORN-2611) [UMBRELLA] YuniKorn 1.5.1 release efforts

2024-05-22 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2611. Fix Version/s: 1.5.1 Resolution: Fixed > [UMBRELLA] YuniKorn 1.5.1 rele

[jira] [Resolved] (YUNIKORN-2614) Update website for 1.5.1

2024-05-22 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2614?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2614. Fix Version/s: 1.5.1 Target Version: 1.5.1 Resolution: Fixed > Upd

[jira] [Created] (YUNIKORN-2639) Clarify release procedure for minor releases

2024-05-21 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2639: -- Summary: Clarify release procedure for minor releases Key: YUNIKORN-2639 URL: https://issues.apache.org/jira/browse/YUNIKORN-2639 Project: Apache YuniKorn

[ANNOUNCE] YuniKorn 1.5.1 released

2024-05-21 Thread Peter Bacsko
Hi all, It gives me great pleasure to announce that the Apache YuniKorn community has voted to release Apache YuniKorn v1.5.1. This a minor release which contains 18 fixes. The release details are on the v1.5.1 announcement page [1]. You can also download the release from the Downloads page

[jira] [Created] (YUNIKORN-2633) Unnecessary warning from Partition when adding an application

2024-05-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2633: -- Summary: Unnecessary warning from Partition when adding an application Key: YUNIKORN-2633 URL: https://issues.apache.org/jira/browse/YUNIKORN-2633 Project

[jira] [Resolved] (YUNIKORN-2613) Release notes for 1.5.1

2024-05-17 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2613?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2613. Fix Version/s: 1.5.1 Resolution: Fixed > Release notes for 1.

[jira] [Resolved] (YUNIKORN-2632) Data race in IncAllocatedResource

2024-05-17 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2632. Fix Version/s: 1.6.0 1.5.2 Resolution: Fixed > Data r

[jira] [Created] (YUNIKORN-2632) Data race in IncAllocatedResource

2024-05-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2632: -- Summary: Data race in IncAllocatedResource Key: YUNIKORN-2632 URL: https://issues.apache.org/jira/browse/YUNIKORN-2632 Project: Apache YuniKorn Issue

Re: [VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-16 Thread Peter Bacsko
Yes, that is correct. On Thu, May 16, 2024 at 8:54 PM Desai, Mit wrote: > This issue could also be faced by non-autoscaled clusters who still gets a > node added at some point. Right? > > -Mit > > From: Peter Bacsko > Date: Thursday, May 16, 2024 at 11:23 AM > To:

Re: [VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-16 Thread Peter Bacsko
e could probably push for 1.5.2 alone. But > either way, 1.5.1 is already baked. > > > Craig > > > > On May 16, 2024, at 1:06 PM, Peter Bacsko wrote: > > > > Dear community, > > > > I've been working together with Jacob Salway on an issue and

Re: [VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-16 Thread Peter Bacsko
Peter Bacsko wrote: > +1 binding > > - Built images from source (amd64) on Ubuntu 22.04 > - Run make test && make image > - Run it on a local cluster > - Checked some REST API endpoints > - Ran sample jobs > > Thank you all for the voting on the RC1 for 1.5.1. &g

[jira] [Created] (YUNIKORN-2629) Adding a node can result in a deadlock

2024-05-16 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2629: -- Summary: Adding a node can result in a deadlock Key: YUNIKORN-2629 URL: https://issues.apache.org/jira/browse/YUNIKORN-2629 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2612) Tagging for 1.5.1

2024-05-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2612. Fix Version/s: 1.5.1 Resolution: Fixed > Tagging for 1.

Re: [VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-16 Thread Peter Bacsko
, 2024 at 9:41 PM Desai, Mit > wrote: > > > +1 (non-binding) > > > > > > * Built release on MacOS Sonoma (arm64) > > * Installed locally on Kind Cluster (1.28) > > * Successfully ran make test > > * Ran sample sleep jobs > > &

Re: [VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-14 Thread Peter Bacsko
> - Ran make test, all tests passed > > - Installed locally on Kind cluster (1.29) > > > > - REST interface checks: > > - verified the SHA references in the cluster detail > > - verified the build date is set correctly > > - checked REST endpoints and UI > &

[jira] [Resolved] (YUNIKORN-2623) Create unit tests for Clients

2024-05-14 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2623. Fix Version/s: 1.6.0 Resolution: Fixed > Create unit tests for Clie

Re: [VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-13 Thread Peter Bacsko
Thanks everyone for testing So far we have 3 non-binding +1s. I'll probably extend the voting for another 24 hours to get some binding feedbacks as well. Peter On Mon, May 13, 2024 at 8:15 PM 陳昱霖 wrote: > +1 (non-binding) > > - Verified signatures and checksums > - Built on Ubuntu

[jira] [Created] (YUNIKORN-2623) Create unit test coverage for Clients

2024-05-13 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2623: -- Summary: Create unit test coverage for Clients Key: YUNIKORN-2623 URL: https://issues.apache.org/jira/browse/YUNIKORN-2623 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2620) Remove redundant variable `errorExpected` from configvalidator_test.go

2024-05-11 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2620?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2620. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > Remove redund

[VOTE] Release Apache YuniKorn 1.5.1 RC1

2024-05-10 Thread Peter Bacsko
Hello everyone, I would like to call a vote for releasing Apache YuniKorn 1.5.1 RC1. This is a minor release which contains only bugfixes. The release artefacts have been uploaded here: https://dist.apache.org/repos/dist/dev/yunikorn/1.5.1-RC1/ My public key is located in the KEYS file:

Re: [DISCUSSION] Yunikorn release 1.5.1

2024-05-08 Thread Peter Bacsko
o make > progress on it without blocking the 1.5.1 patch release as it has > considerable fixes already (re: deadlock) > > Shravan > > On 2024/04/29 15:20:27 Peter Bacsko wrote: > > Hey Wilfred, > > > > Yes, I'm taking the role of release manager. >

[jira] [Created] (YUNIKORN-2614) Update website for 1.5.1

2024-05-08 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2614: -- Summary: Update website for 1.5.1 Key: YUNIKORN-2614 URL: https://issues.apache.org/jira/browse/YUNIKORN-2614 Project: Apache YuniKorn Issue Type: Sub

[jira] [Created] (YUNIKORN-2613) Release notes for 1.5.1

2024-05-08 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2613: -- Summary: Release notes for 1.5.1 Key: YUNIKORN-2613 URL: https://issues.apache.org/jira/browse/YUNIKORN-2613 Project: Apache YuniKorn Issue Type: Sub

[jira] [Created] (YUNIKORN-2612) Tagging for 1.5.1

2024-05-08 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2612: -- Summary: Tagging for 1.5.1 Key: YUNIKORN-2612 URL: https://issues.apache.org/jira/browse/YUNIKORN-2612 Project: Apache YuniKorn Issue Type: Sub-task

[jira] [Created] (YUNIKORN-2611) [UMBRELLA] YuniKorn 1.5.1 release efforts

2024-05-08 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2611: -- Summary: [UMBRELLA] YuniKorn 1.5.1 release efforts Key: YUNIKORN-2611 URL: https://issues.apache.org/jira/browse/YUNIKORN-2611 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2600) Update K8s dependency to 1.29.4

2024-05-06 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2600?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2600. Fix Version/s: 1.6.0 1.5.1 Resolution: Fixed Merged to master

[jira] [Created] (YUNIKORN-2602) Fix spelling/grammar in configvalidator

2024-05-04 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2602: -- Summary: Fix spelling/grammar in configvalidator Key: YUNIKORN-2602 URL: https://issues.apache.org/jira/browse/YUNIKORN-2602 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2600) Update K8s dependency to 1.29.4

2024-05-03 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2600: -- Summary: Update K8s dependency to 1.29.4 Key: YUNIKORN-2600 URL: https://issues.apache.org/jira/browse/YUNIKORN-2600 Project: Apache YuniKorn Issue Type

[jira] [Resolved] (YUNIKORN-2472) REST API returns subtree by default

2024-05-03 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2472. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > REST API retu

[jira] [Resolved] (YUNIKORN-2573) Flaky test TestUpdateNodeCapacityWithMultipleNodes

2024-05-03 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2573?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2573. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > Flaky t

[jira] [Created] (YUNIKORN-2599) AppStateChange/AppTaskCompleted event cannot be handled in many states

2024-05-02 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2599: -- Summary: AppStateChange/AppTaskCompleted event cannot be handled in many states Key: YUNIKORN-2599 URL: https://issues.apache.org/jira/browse/YUNIKORN-2599

[jira] [Resolved] (YUNIKORN-2597) Improve error messages in Context

2024-05-02 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2597?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2597. Fix Version/s: 1.6.0 1.5.1 Resolution: Fixed Merged to master

[jira] [Resolved] (YUNIKORN-2518) Allow recovery queue in REST requests

2024-05-02 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2518?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2518. Fix Version/s: 1.6.0 Resolution: Fixed > Allow recovery queue in REST reque

[jira] [Created] (YUNIKORN-2597) Fix error messages in Context

2024-04-30 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2597: -- Summary: Fix error messages in Context Key: YUNIKORN-2597 URL: https://issues.apache.org/jira/browse/YUNIKORN-2597 Project: Apache YuniKorn Issue Type

[jira] [Resolved] (YUNIKORN-2297) Update the unit test for CheckQueuesStructure

2024-04-30 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2297. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > Update the unit t

Re: [DISCUSSION] Yunikorn release 1.5.1

2024-04-29 Thread Peter Bacsko
Hey Wilfred, Yes, I'm taking the role of release manager. I cherry-picked YUNIKORN-2520 to branch-1.5. Regarding the remaining JIRAs, I asked PoAn Yang on Slack to take a look at YUNIKORN-2057 as he originally volunteered to solve it. I told him that it was not urgent, but depending on how

[DISCUSSION] Yunikorn release 1.5.1

2024-04-28 Thread Peter Bacsko
Hi all, Due to the number of problems that we have discovered since the release of 1.5.0, I believe it makes sense to create a new Yunikorn release which consists of bug fixes only. If I'm not mistaken we haven't done this before (at least since leaving the ASF incubator), so this would be the

[jira] [Resolved] (YUNIKORN-2583) Possible log spew on DEBUG level from objects.Node

2024-04-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2583. Resolution: Won't Do > Possible log spew on DEBUG level from objects.N

[jira] [Created] (YUNIKORN-2583) Possible log spew on DEBUG level when predicates are evaluated

2024-04-24 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2583: -- Summary: Possible log spew on DEBUG level when predicates are evaluated Key: YUNIKORN-2583 URL: https://issues.apache.org/jira/browse/YUNIKORN-2583 Project

[jira] [Resolved] (YUNIKORN-2544) [UMBRELLA] Fix Yunikorn potential locking issues

2024-04-22 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2544. Resolution: Fixed All subtasks have been resolved, closing ticket. > [UMBRELLA]

[jira] [Created] (YUNIKORN-2574) totalPartitionResource should not be mutated with AddTo/SubFrom

2024-04-22 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2574: -- Summary: totalPartitionResource should not be mutated with AddTo/SubFrom Key: YUNIKORN-2574 URL: https://issues.apache.org/jira/browse/YUNIKORN-2574 Project

[jira] [Resolved] (YUNIKORN-2562) Nil pointer panic in Application.ReplaceAllocation()

2024-04-22 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2562. Fix Version/s: 1.6.0 1.5.1 Resolution: Fixed > Nil poin

[jira] [Created] (YUNIKORN-2568) Move all xxxEvents types to objects/events

2024-04-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2568: -- Summary: Move all xxxEvents types to objects/events Key: YUNIKORN-2568 URL: https://issues.apache.org/jira/browse/YUNIKORN-2568 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2566) Remove AllocationAsk reference from askEvents

2024-04-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2566: -- Summary: Remove AllocationAsk reference from askEvents Key: YUNIKORN-2566 URL: https://issues.apache.org/jira/browse/YUNIKORN-2566 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2567) Remove Application reference from applicationEvents

2024-04-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2567: -- Summary: Remove Application reference from applicationEvents Key: YUNIKORN-2567 URL: https://issues.apache.org/jira/browse/YUNIKORN-2567 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2564) [Umbrella] Move xxxEvents types to a different package

2024-04-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2564: -- Summary: [Umbrella] Move xxxEvents types to a different package Key: YUNIKORN-2564 URL: https://issues.apache.org/jira/browse/YUNIKORN-2564 Project: Apache

[jira] [Created] (YUNIKORN-2565) Remove Node reference from nodeEvents

2024-04-17 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2565: -- Summary: Remove Node reference from nodeEvents Key: YUNIKORN-2565 URL: https://issues.apache.org/jira/browse/YUNIKORN-2565 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2563) [shim] Enable deadlock detection during unit tests

2024-04-16 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2563: -- Summary: [shim] Enable deadlock detection during unit tests Key: YUNIKORN-2563 URL: https://issues.apache.org/jira/browse/YUNIKORN-2563 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2553) Enable deadlock detection during unit tests

2024-04-16 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2553?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2553. Fix Version/s: 1.6.0 1.5.1 Target Version: 1.6.0, 1.5.1

[jira] [Created] (YUNIKORN-2562) Nil pointer in Application.ReplaceAllocation()

2024-04-16 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2562: -- Summary: Nil pointer in Application.ReplaceAllocation() Key: YUNIKORN-2562 URL: https://issues.apache.org/jira/browse/YUNIKORN-2562 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2552) Recursive locking when sending remove queue event

2024-04-15 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2552. Fix Version/s: 1.6.0 1.5.1 Resolution: Fixed > Recurs

[jira] [Resolved] (YUNIKORN-2550) Fix locking in PartitionContext

2024-04-15 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2550. Fix Version/s: 1.6.0 1.5.1 Target Version: 1.6.0, 1.5.1

[jira] [Created] (YUNIKORN-2554) Remove "rules" field from PartitionContext

2024-04-12 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2554: -- Summary: Remove "rules" field from PartitionContext Key: YUNIKORN-2554 URL: https://issues.apache.org/jira/browse/YUNIKORN-2554 Project: Apach

[jira] [Resolved] (YUNIKORN-2549) Fixing lint issues

2024-04-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2549. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > Fixing lint iss

[jira] [Created] (YUNIKORN-2553) Integrate deadlock detection with unit test

2024-04-11 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2553: -- Summary: Integrate deadlock detection with unit test Key: YUNIKORN-2553 URL: https://issues.apache.org/jira/browse/YUNIKORN-2553 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2552) Recursive locking when sending Queue events

2024-04-11 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2552: -- Summary: Recursive locking when sending Queue events Key: YUNIKORN-2552 URL: https://issues.apache.org/jira/browse/YUNIKORN-2552 Project: Apache YuniKorn

Re: [DISCUSSION] (Potential) Locking issues in Yunikorn

2024-04-11 Thread Peter Bacsko
equiring locks, so for RMProxy I’m +1 on that. The extra memory for an > RMProxy instance is irrelevant. > > > > The recursive locking case is a real problem, and I’m surprised that > hasn’t bitten us harder. It can cause all sorts of issues. > > > > Craig > > > >

[jira] [Created] (YUNIKORN-2550) Fix locking in PartitionContext

2024-04-11 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2550: -- Summary: Fix locking in PartitionContext Key: YUNIKORN-2550 URL: https://issues.apache.org/jira/browse/YUNIKORN-2550 Project: Apache YuniKorn Issue Type

[jira] [Resolved] (YUNIKORN-2543) Fix locking in RMProxy

2024-04-10 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2543?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2543. Fix Version/s: 1.6.0 Resolution: Fixed > Fix locking in RMPr

[jira] [Resolved] (YUNIKORN-2545) Eliminate multiple lock calls from Queue

2024-04-10 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2545?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2545. Fix Version/s: 1.6.0 Resolution: Fixed > Eliminate multiple lock calls f

Re: [DISCUSSION] Deprecation of YuniKorn plugin deployment mode

2024-04-09 Thread Peter Bacsko
Hi all, thanks Craig for writing this excellent, detailed summary, including historical context. As we already talked about it on Slack, I'm definitely +1 for removing the plugin. My main gripes are: 1. It overcomplicates the codebase. Two branches for plugin vs non-plugin mode, scheduler cache

[jira] [Created] (YUNIKORN-2547) Queue: refactor quota/guranteed resource parsing logic when adding application

2024-04-08 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2547: -- Summary: Queue: refactor quota/guranteed resource parsing logic when adding application Key: YUNIKORN-2547 URL: https://issues.apache.org/jira/browse/YUNIKORN-2547

[jira] [Resolved] (YUNIKORN-2513) Fix event system to use event.requestCapacity

2024-04-06 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2513?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2513. Fix Version/s: 1.6.0 Resolution: Fixed > Fix event system to

[jira] [Created] (YUNIKORN-2545) Eliminate double read lock calls from Queue.GetPartitionQueueDAOInfo()

2024-04-06 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2545: -- Summary: Eliminate double read lock calls from Queue.GetPartitionQueueDAOInfo() Key: YUNIKORN-2545 URL: https://issues.apache.org/jira/browse/YUNIKORN-2545

[jira] [Created] (YUNIKORN-2544) [UMBRELLA] Fix Yunikorn potential locking issues

2024-04-06 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2544: -- Summary: [UMBRELLA] Fix Yunikorn potential locking issues Key: YUNIKORN-2544 URL: https://issues.apache.org/jira/browse/YUNIKORN-2544 Project: Apache YuniKorn

[DISCUSSION] (Potential) Locking issues in Yunikorn

2024-04-06 Thread Peter Bacsko
Hi all, after YUNIKORN-2539 got merged, we identified some potential deadlocks. These are false positives now, but a small change can cause Yunikorn to fall apart, so the term "potential deadlock" describes them properly. Thoughs, opinions are welcome. IMO we should handle these with priority to

[jira] [Created] (YUNIKORN-2543) Examine locking in RMProxy

2024-04-05 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2543: -- Summary: Examine locking in RMProxy Key: YUNIKORN-2543 URL: https://issues.apache.org/jira/browse/YUNIKORN-2543 Project: Apache YuniKorn Issue Type

[jira] [Resolved] (YUNIKORN-2423) Remove unnecessary boolean return value from the tracking code

2024-04-05 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2423. Fix Version/s: 1.6.0 Resolution: Fixed > Remove unnecessary boolean return va

[jira] [Created] (YUNIKORN-2542) Consistent logging and tracker handling for increment/decrement

2024-04-05 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2542: -- Summary: Consistent logging and tracker handling for increment/decrement Key: YUNIKORN-2542 URL: https://issues.apache.org/jira/browse/YUNIKORN-2542 Project

[jira] [Created] (YUNIKORN-2541) Fix CVE-2023-45288

2024-04-05 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2541: -- Summary: Fix CVE-2023-45288 Key: YUNIKORN-2541 URL: https://issues.apache.org/jira/browse/YUNIKORN-2541 Project: Apache YuniKorn Issue Type: Improvement

[jira] [Resolved] (YUNIKORN-2525) dispatcher.Stop() waits an extra second unnecessarily

2024-04-04 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2525. Fix Version/s: 1.6.0 Resolution: Fixed > dispatcher.Stop() waits an extra sec

[jira] [Resolved] (YUNIKORN-2474) Unused variable and methods

2024-04-03 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2474?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2474. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > Unused varia

[jira] [Created] (YUNIKORN-2531) Create unit tests for AsyncRMCallback

2024-04-02 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2531: -- Summary: Create unit tests for AsyncRMCallback Key: YUNIKORN-2531 URL: https://issues.apache.org/jira/browse/YUNIKORN-2531 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2528) Increase coverage for UGM code

2024-04-02 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2528: -- Summary: Increase coverage for UGM code Key: YUNIKORN-2528 URL: https://issues.apache.org/jira/browse/YUNIKORN-2528 Project: Apache YuniKorn Issue Type

[jira] [Created] (YUNIKORN-2525) Make dispatcher.Stop() shut down quicker

2024-04-01 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2525: -- Summary: Make dispatcher.Stop() shut down quicker Key: YUNIKORN-2525 URL: https://issues.apache.org/jira/browse/YUNIKORN-2525 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2520) PVC errors in AssumePod() is not handled properly

2024-03-27 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2520: -- Summary: PVC errors in AssumePod() is not handled properly Key: YUNIKORN-2520 URL: https://issues.apache.org/jira/browse/YUNIKORN-2520 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2516) Update documentation about event.RESTResponseSize

2024-03-25 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2516: -- Summary: Update documentation about event.RESTResponseSize Key: YUNIKORN-2516 URL: https://issues.apache.org/jira/browse/YUNIKORN-2516 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2515) Add property event.RESTresponseSize to the batch event handler

2024-03-25 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2515: -- Summary: Add property event.RESTresponseSize to the batch event handler Key: YUNIKORN-2515 URL: https://issues.apache.org/jira/browse/YUNIKORN-2515 Project

[jira] [Created] (YUNIKORN-2513) Fix event system to use request.eventCapacity

2024-03-25 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2513: -- Summary: Fix event system to use request.eventCapacity Key: YUNIKORN-2513 URL: https://issues.apache.org/jira/browse/YUNIKORN-2513 Project: Apache YuniKorn

[jira] [Created] (YUNIKORN-2514) Update documentation about event.requestCapacity

2024-03-25 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2514: -- Summary: Update documentation about event.requestCapacity Key: YUNIKORN-2514 URL: https://issues.apache.org/jira/browse/YUNIKORN-2514 Project: Apache YuniKorn

[jira] [Resolved] (YUNIKORN-2442) Documentation update about the event system

2024-03-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2442?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2442. Fix Version/s: 1.6.0 Resolution: Fixed > Documentation update about the ev

[jira] [Resolved] (YUNIKORN-2444) Create user guide for the event system

2024-03-25 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2444?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2444. Fix Version/s: 1.6.0 Resolution: Fixed > Create user guide for the event sys

[jira] [Created] (YUNIKORN-2512) event.requestCapacity is not used

2024-03-25 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2512: -- Summary: event.requestCapacity is not used Key: YUNIKORN-2512 URL: https://issues.apache.org/jira/browse/YUNIKORN-2512 Project: Apache YuniKorn Issue

[jira] [Created] (YUNIKORN-2510) Placeholder processing starts immediately despite maxApplications limit

2024-03-21 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2510: -- Summary: Placeholder processing starts immediately despite maxApplications limit Key: YUNIKORN-2510 URL: https://issues.apache.org/jira/browse/YUNIKORN-2510

[jira] [Resolved] (YUNIKORN-2475) Add support for K8s native sidecars (restartable init containers)

2024-03-20 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2475. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. > Add support for

[jira] [Resolved] (YUNIKORN-2478) make image does not work when Minikube docker env is set

2024-03-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2478. Resolution: Duplicate I created the JIRA multiple times due to network problems

[jira] [Resolved] (YUNIKORN-2477) make image does not work when Minikube docker env is set

2024-03-12 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2477?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2477. Resolution: Duplicate > make image does not work when Minikube docker env is

[jira] [Created] (YUNIKORN-2478) make image does not work when Minikube docker env is set

2024-03-12 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2478: -- Summary: make image does not work when Minikube docker env is set Key: YUNIKORN-2478 URL: https://issues.apache.org/jira/browse/YUNIKORN-2478 Project: Apache

[jira] [Created] (YUNIKORN-2476) make image does not work when Minikube docker env is set

2024-03-12 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2476: -- Summary: make image does not work when Minikube docker env is set Key: YUNIKORN-2476 URL: https://issues.apache.org/jira/browse/YUNIKORN-2476 Project: Apache

[jira] [Created] (YUNIKORN-2477) make image does not work when Minikube docker env is set

2024-03-12 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2477: -- Summary: make image does not work when Minikube docker env is set Key: YUNIKORN-2477 URL: https://issues.apache.org/jira/browse/YUNIKORN-2477 Project: Apache

Re: [VOTE]Release Apache YuniKorn 1.5.0 RC2

2024-03-11 Thread Peter Bacsko
+1 Environment: Ubuntu 22.04 amd64 - Checked signatures and checksums - Built from source - Installed on Minikube - Checked some API endpoints (batch API) - Ran sleep jobs - Checked the web UI I just found a minor thing regarding Minikube which I'll report soon. On Mon, Mar 11, 2024 at 8:22 AM

[jira] [Created] (YUNIKORN-2475) Add support for K8s native sidecars (restartable init containers)

2024-03-11 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2475: -- Summary: Add support for K8s native sidecars (restartable init containers) Key: YUNIKORN-2475 URL: https://issues.apache.org/jira/browse/YUNIKORN-2475 Project

[jira] [Created] (YUNIKORN-2467) Remove AllocationAsk from the core when a pod is completed

2024-03-05 Thread Peter Bacsko (Jira)
Peter Bacsko created YUNIKORN-2467: -- Summary: Remove AllocationAsk from the core when a pod is completed Key: YUNIKORN-2467 URL: https://issues.apache.org/jira/browse/YUNIKORN-2467 Project: Apache

[jira] [Resolved] (YUNIKORN-2453) Add `EventRecord_APP_NEW` to event of created/submitted application

2024-02-27 Thread Peter Bacsko (Jira)
[ https://issues.apache.org/jira/browse/YUNIKORN-2453?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Bacsko resolved YUNIKORN-2453. Fix Version/s: 1.6.0 Resolution: Fixed Merged to master. >

  1   2   3   4   5   >