Hi Cheng, Thanks for your test on k8s! I agree that we should fix the configurations before 0.2.0.
Thanks, Keyong Zhou Cheng Pan <pan3...@gmail.com> 于2022年12月20日周二 15:48写道: > Hi community, > > I test branch-0.2 on K8s w/ CELEBORN-151 and some internal modification by > running some TPC-DS quires, works as expected. > > I noticed that there are some configurations still use `rss.` prefix > instead of `celeborn.`, we should fix them before releasing 0.2.0. > > Thanks, > Cheng Pan > > > On Dec 19, 2022 at 18:25:44, keyong zhou <waitinfut...@gmail.com> wrote: > > > Hi Celeborn (Incubating) community, > > > > I have tested the core-path and fixed on bug[1]. After fixing this, my > test > > seems OK. > > > > The way I tested is as follows: > > I run 1T TPCDS with Celeborn on, then I randomly kill worker during the > > test, carefully not to trigger datalost. In the end I check whether the > > results are correct. > > > > [1] https://github.com/apache/incubator-celeborn/pull/1101 > > > > Thanks, > > Keyong Zhou > > > > Keyong Zhou <zho...@apache.org> 于2022年12月14日周三 19:41写道: > > > > Hi celeborn (-incubating) community: > > > > > > Currently we are preparing for the first release (branch-0.2). To ensure > > > > code quality, I would like to test for core-path correctness and > stability, > > > > could Angerszhuuuu <angers....@gmail.com> and nafiyaix > > > > <nafiyai...@gmail.com> help test graceful shutdown and rolling upgrade? > > > > And could Ethan Feng <ethan.aquarius....@gmail.com> help test load-aware > > > > slots allocation? > > > > > > And we would be rather happy if anyone can help test for other modules > > > > (k8s, HA, etc.). > > > > > > Thanks, > > > > Keyong > > > > > > >