wyc-ruiker opened a new pull request #8402:
URL: https://github.com/apache/tvm/pull/8402


   Let dense_tensorcore and batch_matmul_tensorcore support int8/int4.
   Before this pr, the [vision transform 
(vit)](https://arxiv.org/abs/2010.11929) latency 
(https://github.com/apache/tvm/pull/7814) in Tesla T4 is: 
   vit int4: 4.71 ms
   vit int8: 3.48 ms
   After this pr:
   vit int4: 2.93 ms
   vit int8: 2.97 ms


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@tvm.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to