lhutton1 opened a new pull request, #15470: URL: https://github.com/apache/tvm/pull/15470
Currently the fallback used when compiling a dense operation with targets such as `llvm -device=arm_cpu` is `dense.generic`. This results very poor performance. Although https://github.com/apache/tvm/pull/13775 meant that x86 schedules are used in cases where no strategy is provided by arm_cpu, the dense strategy is registered due to the existence of specialized schedules for arm_cpu e.g. a schedule for embedded devices. This commit ensures x86 schedules are used inplace of a generic schedule which yields much better performance. The commit also follows the same approach for the `dense.generic` schedule as the x86 strategy. This will only be used when auto-scheduler is enabled. A test has been added to check the intended schedules are picked when compiling with `arm_cpu`. cc @ekalda @neildhickey -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
