Thank you guys for the valuable tests!

angers zhu <angers....@gmail.com> 于2022年12月21日周三 10:33写道:

> Hi all,
>
> I have tested quota management and make some minor improvements. It can
> work well.
> later I will test worker quick restart.
>
> apache <ethanf...@apache.org> 于2022年12月21日周三 10:00写道:
>
>> Hi community:
>> I’ve tested branch-0.2 about load-aware slots allocation modules with a
>> fusion cluster of HDDs, SSDs, and ESSDs. As expected, workers with faster
>> disk drives have more slots on them.
>>
>> The way I tested is as follows:
>> I ran 1T TPC-DS while 1TB Terasort was running repeatedly and tested
>> different parameter combinations on three node groups. The group core has
>> 10x NVME SSD drives. Group task 1 has 4x ESSD drives. Group task 2 has 12x
>> SATA HDD drivers.
>>
>> Here are some snapshots:
>>
>> Graph 1: Only NVME SSD and HDD workers are involved and
>> "celeborn.slots.assign.loadAware.numDiskGroups" is set to 2.
>>
>>
>> Graph 2: All nodes are involved and
>> "celeborn.slots.assign.loadAware.numDiskGroups" is set to 3.
>>
>>
>> Thanks,
>> Ethan Feng
>> 在 2022年12月14日 +0800 19:41,Keyong Zhou <zho...@apache.org>,写道:
>>
>> Hi celeborn (-incubating) community:
>>
>> Currently we are preparing for the first release (branch-0.2). To ensure
>> code quality, I would like to test for core-path correctness and
>> stability,
>> could Angerszhuuuu <angers....@gmail.com> and nafiyaix
>> <nafiyai...@gmail.com> help test graceful shutdown and rolling upgrade?
>> And
>> could Ethan Feng <ethan.aquarius....@gmail.com> help test load-aware
>> slots
>> allocation?
>>
>> And we would be rather happy if anyone can help test for other modules
>> (k8s, HA, etc.).
>>
>> Thanks,
>> Keyong
>>
>>

Reply via email to