lhutton1 opened a new pull request, #12160: URL: https://github.com/apache/tvm/pull/12160
The NPU support library compiler sometimes adds padding to input tensors which means the buffer sizes calculated at runtime can sometimes be smaller than necessary. Instead, buffer sizes are now collected at compile time and passed to the runtime so that they match the sizes expected by the compiled network. This was seen when running a fully connected operation with an input that is not a multiple of 1024, so testing has been added to cover this case. Additionally changed the fully connected test case to use pytest parameterization as part of a general cleanup, and fixed the fully connected testing to support output channels > 1. cc @Leo-arm @manupa-arm @leandron -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: commits-unsubscr...@tvm.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org