lhutton1 opened a new pull request, #15470:
URL: https://github.com/apache/tvm/pull/15470

   Currently the fallback used when compiling a dense operation with targets 
such as `llvm -device=arm_cpu` is `dense.generic`. This results very poor 
performance. Although https://github.com/apache/tvm/pull/13775 meant that x86 
schedules are used in cases where no strategy is provided by arm_cpu, the dense 
strategy is registered due to the existence of specialized schedules for 
arm_cpu e.g. a schedule for embedded devices. This commit ensures x86 schedules 
are used inplace of a generic schedule which yields much better performance.
   
   The commit also follows the same approach for the `dense.generic` schedule 
as the x86 strategy. This will only be used when auto-scheduler is enabled.
   
   A test has been added to check the intended schedules are picked when 
compiling with `arm_cpu`.
   
   cc @ekalda @neildhickey 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to