dsbarinov1 opened a new pull request, #14209:
URL: https://github.com/apache/tvm/pull/14209

   Motivation:
   The purpose of this PR is to add support for intrinsics to optimize matrix 
multiplication operations (e.g. matmul, convolution) during tuning with 
MetaScheduler.
   
   Information about PR:
   The present PR integrates the existing neon and dotprod (namely, sdot and 
udot) ARM CPU intrinsics into MetaScheduler, introduces a new "hybrid" dotprod 
intrinsic ("hdot") working with uint8, uint8 -> int32 data types, and changes 
the intrinsic selection and application processes for the ARM CPU case, since 
we operate with multiple intrinsics, rather than with a specific one.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to