Chris Yeung created SINGA-512:
---------------------------------

             Summary: Support of fused ops to increase throughput performance
                 Key: SINGA-512
                 URL: https://issues.apache.org/jira/browse/SINGA-512
             Project: Singa
          Issue Type: Improvement
          Components: Core
            Reporter: Chris Yeung


In Cudnn 7.6, a new API is introduced for fused ops, which can accelerate many 
use cases in ResNet-like networks. With this new API it is now possible to 
execute various fused operations such as apply per channel scale and bias, 
perform activation, compute convolution, and generate batchnorm statistics. 

Reference:
https://docs.nvidia.com/deeplearning/cudnn/release-notes/rel_7xx.html#rel_760
 
The goal is to increase the image throughput of ResNet. Currently, this task is 
assigned to Naili.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to