wyc-ruiker opened a new pull request #8402: URL: https://github.com/apache/tvm/pull/8402
Let dense_tensorcore and batch_matmul_tensorcore support int8/int4. Before this pr, the [vision transform (vit)](https://arxiv.org/abs/2010.11929) latency (https://github.com/apache/tvm/pull/7814) in Tesla T4 is: vit int4: 4.71 ms vit int8: 3.48 ms After this pr: vit int4: 2.93 ms vit int8: 2.97 ms -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@tvm.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org