dsbarinov1 opened a new pull request, #14209:
URL: https://github.com/apache/tvm/pull/14209
Motivation:
The purpose of this PR is to add support for intrinsics to optimize matrix
multiplication operations (e.g. matmul, convolution) during tuning with
MetaScheduler.
Information about PR:
The present PR integrates the existing neon and dotprod (namely, sdot and
udot) ARM CPU intrinsics into MetaScheduler, introduces a new "hybrid" dotprod
intrinsic ("hdot") working with uint8, uint8 -> int32 data types, and changes
the intrinsic selection and application processes for the ARM CPU case, since
we operate with multiple intrinsics, rather than with a specific one.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]