MoisesHer opened a new pull request #16355: Pr embedding gradient
URL: https://github.com/apache/incubator-mxnet/pull/16355
 
 
   ## Description ##
   This PR includes a specific Embedding-backward operator for GPU.
   Two new CUDA kernels have been implemented for improving the performance of 
the operator when using GPU.
   According to our measurements on Volta GPUs, the previous version was taken 
2.2ms,
   whereas the new implementation takes 0.3ms, i.e. more than 7x speedup.
   ## Checklist ##
   ### Essentials ###
   - [X ] Changes are complete (i.e. I finished coding on this PR)
   - [X] All changes have test coverage:
   - Unit tests already existed 
(tests/python/gpu/test_operator_gpu:test_embedding_with_type) and changes do 
not affect correctness.
   - [X] Code is well-documented:
   - For new C++ functions in header files, their functionalities and arguments 
are documented.
   ### Changes ###
   - [x ] Embedding-backward operator for GPU, test: 
tests/python/gpu/test_operator_gpu:test_embedding_with_type

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to