nihui commented on issue #10804: Use depthwise convolution(group convolution) 
by cuDNNv7 if available
URL: https://github.com/apache/incubator-mxnet/pull/10804#issuecomment-389411880
 
 
   hello
   
   some feedback about the speed
   
   hardware: tesla-m40 24G x 2
   system: centos-7
   nvidia-387.26
   cuda-9.1
   cudnn-v7.1
   
   model: mobilenet-v2
   batchsize 256 (128 per gpu)
   
   mxnet implementation: 68s/10iter
   cudnnv7 implementation: 9.5s/10iter
   
   ps: need to comment mxnet DepthwiseConvolutionOp path in 
src/operator/nn/convolution.cu to enable cudnn one
   

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to