Hi Yimin, Welcome to make contribution to MXNet project!
From <https://github.com/bytedance/byteps/blob/master/README.md> https://github.com/bytedance/byteps/blob/master/README.md I found another limitation that isn't shown in your proposal: BytePS does not support pure CPU training for now. One reason is that the cheap PS assumption<https://github.com/bytedance/byteps/blob/master/docs/rationale.md> of BytePS do not hold for CPU training. Consequently, you need CUDA and NCCL to build and run BytePS. I have a couple of question for this: How's the status of CPU training support? If CPU training isn't supported yet, what's the challenge to support it? Do you have a plan to support it? Thanks, Zhennan On Wed, 2019-11-06 at 12:14 +0800, Yimin Jiang wrote: Hi MXNet Community, BytePS (https://github.com/bytedance/byteps) is a high-performance, cross-framework architecture for distributed training. BytePS developers are planning to integrate a part of BytePS into MXNet. The link below is the proposal. Feedbacks are welcome. https://cwiki.apache.org/confluence/display/MXNET/BytePS-MXNet+Integration Thank you, Yimin Jiang