Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-21 Thread Weiwei Yang
Hi all I do not think we should get 0.12.2 out as we know we need to fix this bug. Having 0.12.2 out with a known issue doesn't sound better than just withdrawing it and re-release 0.12.2, using 0.12.2-RC3. Can we just withdraw the IPMC vote and start 0.12.2-RC3 right away? On Fri, Jan 21, 2022 a

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-21 Thread Craig Condit
Chaoran, nice catch on this one. Unfortunate that we didn’t find it before cutting 0.12.2. I agree with Wilfred that we can add to the release notes on the website, but that we should back port to 0.12.3 as well. I can RM that release as well, unless someone else wants to volunteer. - Craig

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Wilfred Spiegelenburg
We have seen large numbers of people running and deploying. I have opened a PR with the fix. The scheduler should not get deleted, unless scaled down on purpose. It should not get evicted either, it should run as a high priority pod unless we missed that. Crashing of the scheduler is a bug, We sho

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Weiwei Yang
Agree, this needs to be fixed. Likely we need to revoke 0.12.2 and get out a 0.12.3. On Thu, Jan 20, 2022 at 9:56 PM Chaoran Yu wrote: > Yes, Helm install and upgrade both work. > The failure scenario is as follows: > > 1. Both the admission controller and the scheduler pods are running > 2. The

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Chaoran Yu
Yes, Helm install and upgrade both work. The failure scenario is as follows: 1. Both the admission controller and the scheduler pods are running 2. The scheduler pod is restarted for some reason (e.g. deleted, evicted, or crashed) 3. The new scheduler pod will be stuck in the pending state becaus

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Weiwei Yang
Hmmm. that is a bug. But during the release verification, I have tried the helm install, and that works as expected. I am guessing that is because the scheduler always gets started first. Maybe the same for the upgrade? In this case, maybe this can work as long as people are using helm charts to de

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Chaoran Yu
I just spotted a bug https://issues.apache.org/jira/browse/YUNIKORN-1038. which is critical and worth porting back into branch 0.12 On Thu, Jan 20, 2022 at 12:12 PM Sunil Govindan wrote: > A late +1 (binding) from me. > > I build this from source > - Ran basic spark job > - Verified UI > - Check

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Sunil Govindan
A late +1 (binding) from me. I build this from source - Ran basic spark job - Verified UI - Checked signature. - Checked the images. Thanks Sunil On Wed, Jan 19, 2022 at 8:44 AM Craig Condit wrote: > Hi all, > > The vote to Release Apache YuniKorn (incubating) 0.12.2 RC2 has passed > with 3 bi

[RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-19 Thread Craig Condit
Hi all, The vote to Release Apache YuniKorn (incubating) 0.12.2 RC2 has passed with 3 binding +1 votes and 3 non-binding +1 votes. Vote thread: https://lists.apache.org/thread/1gw0k0g5fy86r8ljnjttdco04w7z5j4j Thank you to all t