lhutton1 opened a new pull request, #13092:
URL: https://github.com/apache/tvm/pull/13092

   Adds a pass that analyzes functions partitioned for the NPU and inlines 
those that are deemed "non-compute-intensive" back to the main function so that 
they can be considered for other backends. The current heuristic for deciding a 
non-compute-intensive function is to collectively check all of the operations 
in the function have no multiply accumulate operations. This heuristic is not 
optimal; optimization is left for future exploration.
   
   This pass is inspired by the "IsComputeIntensiveGraph" pass in the TensorRT 
integration.
   
   cc @ashutosh-arm @leandron 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: commits-unsubscr...@tvm.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to