Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-21 Thread Weiwei Yang
Hi all I do not think we should get 0.12.2 out as we know we need to fix this bug. Having 0.12.2 out with a known issue doesn't sound better than just withdrawing it and re-release 0.12.2, using 0.12.2-RC3. Can we just withdraw the IPMC vote and start 0.12.2-RC3 right away? On Fri, Jan 21, 2022

[WITHDRAWN] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-21 Thread Craig Condit
All, I was discussing this a bit more with Weiwei offline, and I think we are better off withdrawing 0.12.2 RC2 and releasing 0.12.2 RC3 instead. I will get a build together shortly and issue a new vote. Thanks, Craig > On Jan 21, 2022, at 9:39 AM, Craig Condit wrote: > > Chaoran, nice

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-21 Thread Craig Condit
Chaoran, nice catch on this one. Unfortunate that we didn’t find it before cutting 0.12.2. I agree with Wilfred that we can add to the release notes on the website, but that we should back port to 0.12.3 as well. I can RM that release as well, unless someone else wants to volunteer. - Craig

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Wilfred Spiegelenburg
We have seen large numbers of people running and deploying. I have opened a PR with the fix. The scheduler should not get deleted, unless scaled down on purpose. It should not get evicted either, it should run as a high priority pod unless we missed that. Crashing of the scheduler is a bug, We

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Weiwei Yang
Agree, this needs to be fixed. Likely we need to revoke 0.12.2 and get out a 0.12.3. On Thu, Jan 20, 2022 at 9:56 PM Chaoran Yu wrote: > Yes, Helm install and upgrade both work. > The failure scenario is as follows: > > 1. Both the admission controller and the scheduler pods are running > 2.

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Chaoran Yu
Yes, Helm install and upgrade both work. The failure scenario is as follows: 1. Both the admission controller and the scheduler pods are running 2. The scheduler pod is restarted for some reason (e.g. deleted, evicted, or crashed) 3. The new scheduler pod will be stuck in the pending state

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Weiwei Yang
Hmmm. that is a bug. But during the release verification, I have tried the helm install, and that works as expected. I am guessing that is because the scheduler always gets started first. Maybe the same for the upgrade? In this case, maybe this can work as long as people are using helm charts to

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Chaoran Yu
I just spotted a bug https://issues.apache.org/jira/browse/YUNIKORN-1038. which is critical and worth porting back into branch 0.12 On Thu, Jan 20, 2022 at 12:12 PM Sunil Govindan wrote: > A late +1 (binding) from me. > > I build this from source > - Ran basic spark job > - Verified UI > -

Re: [RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-20 Thread Sunil Govindan
A late +1 (binding) from me. I build this from source - Ran basic spark job - Verified UI - Checked signature. - Checked the images. Thanks Sunil On Wed, Jan 19, 2022 at 8:44 AM Craig Condit wrote: > Hi all, > > The vote to Release Apache YuniKorn (incubating) 0.12.2 RC2 has passed > with 3

[RESULT] [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-19 Thread Craig Condit
Hi all, The vote to Release Apache YuniKorn (incubating) 0.12.2 RC2 has passed with 3 binding +1 votes and 3 non-binding +1 votes. Vote thread: https://lists.apache.org/thread/1gw0k0g5fy86r8ljnjttdco04w7z5j4j Thank you to all

Re: [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-16 Thread Chaoran Yu
+1 (binding) Checked the following: - Ran unit tests - Built images from source - Installed using the Helm chart - Ran Spark jobs - Checked scheduler logs - Checked UI On Sat, Jan 15, 2022 at 11:38 AM 陳宥騰 wrote: > +1 > - Run unit test(core 、 k8shim) > - Make image(core、k8shim、web) > - Deploy

Re: [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-15 Thread 陳宥騰
+1 - Run unit test(core 、 k8shim) - Make image(core、k8shim、web) - Deploy yunikorn 0.12.2 on k8s 1.20.10 - Check web UI - Check log - Run some examples in k8shim/deployments like sleep、namespace、nginx - Check pprof function like debug/pprof/profile、heap Chenya Zhang 於 2022年1月15日 週六 上午1:51寫道: >

Re: [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-14 Thread Chenya Zhang
+1 - Build from source - Run unit tests - Run sample workloads - Checked logs and K8s messages - Verified the Web UI - Verified REST endpoints - Checked no unexpected files Best, Chenya On Fri, Jan 14, 2022 at 1:00 AM Peter Bacsko wrote: > +1 > > - Verified signature & hash > - Built images

Re: [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-14 Thread Peter Bacsko
+1 - Verified signature & hash - Built images with make - Installed to KIND cluster - Executed batch sleep job - Inspected logs - Checked some REST endpoints Cheers, Peter On Fri, Jan 14, 2022 at 7:49 AM Weiwei Yang wrote: > +1 > > Verified the following things: > >- Checked NOTICE,

Re: [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-13 Thread Weiwei Yang
+1 Verified the following things: - Checked NOTICE, DISCLAIMER, README, and CHANGELOG - Verified signature - Build from source using make - Run unit test using make test - Run some sample workloads, roughly went over the logs and k8s messages - Verified the basic UI

Re: [VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-13 Thread Wilfred Spiegelenburg
+1 Checked the following: * no unexpected binaries or files * signature confirmed * hash confirmed * build from source * startup and run the tests * deployed into kind 1.22.4 using helm charts and local build * checked REST calls: logLevel, healthcheck, and statedump * checked Web UI On Fri, 14

[VOTE] Release Apache YuniKorn (incubating) 0.12.2 RC2

2022-01-13 Thread Craig Condit
Hello everyone, I’d like to call a vote for releasing Apache YuniKorn (incubating) 0.12.2 RC2. The release artifacts have been uploaded here: https://dist.apache.org/repos/dist/dev/incubator/yunikorn/0.12.2-rc2/ My public