Hi Yimin,

Welcome to make contribution to MXNet project!

From <https://github.com/bytedance/byteps/blob/master/README.md> 
https://github.com/bytedance/byteps/blob/master/README.md I found another 
limitation that isn't shown in your proposal:

BytePS does not support pure CPU training for now. One reason is that the cheap 
PS 
assumption<https://github.com/bytedance/byteps/blob/master/docs/rationale.md> 
of BytePS do not hold for CPU training. Consequently, you need CUDA and NCCL to 
build and run BytePS.

I have a couple of question for this: How's the status of CPU training support? 
If CPU training isn't supported yet, what's the challenge to support it? Do you 
have a plan to support it?

Thanks,
Zhennan

On Wed, 2019-11-06 at 12:14 +0800, Yimin Jiang wrote:

Hi MXNet Community,


BytePS (https://github.com/bytedance/byteps) is a high-performance,

cross-framework architecture for distributed training. BytePS developers

are planning to integrate a part of BytePS into MXNet. The link below is

the proposal. Feedbacks are welcome.


https://cwiki.apache.org/confluence/display/MXNET/BytePS-MXNet+Integration



Thank you,

Yimin Jiang

Reply via email to