guan404ming opened a new pull request, #18764:
URL: https://github.com/apache/tvm/pull/18764
## Why
Folding large creation ops (zeros, ones, full, arange) with no tensor inputs
materializes large constants in
the binary unnecessarily, since they are cheap to compute at runtime.
## How
- Add ShouldBeFolded heuristic that skips folding when output exceeds 1024
elements and the op has no tensor inputs
- Check call arguments for tensor inputs, including tuples for call_tir
- Add tests for large creation ops, small creation ops, and large ops with
tensor inputs
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]