This is an automated email from the ASF dual-hosted git repository. apeforest pushed a change to branch benchmark in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git.
from e007dcd adding API doc for Lamb Phase 1 and 2 add 169ed69 Speed fused_op compilation by caching ptx and jit-compiled functions (#16783) No new revisions were added by this update. Summary of changes: src/operator/fusion/fused_op-inl.h | 4 +- src/operator/fusion/fused_op.cc | 50 ++++----- src/operator/fusion/fused_op.cu | 225 ++++++++++++++++++++++--------------- src/operator/fusion/fused_op.h | 43 +++---- tests/python/gpu/test_fusion.py | 25 ++++- 5 files changed, 205 insertions(+), 142 deletions(-)