arina-grovety commented on PR #13212:
URL: https://github.com/apache/tvm/pull/13212#issuecomment-1308761777
> I agree with @lhutton1 here. The knob `--dump-code="relay"` provides a way
to visualize the post-partition relay model. `main` function in this relay
model lists sequence of calls to partitioned functions with appropriate target
annotations. Does the new knob `print_operators_offloading` serve any
additional purpose that I might have missed @sergey-grovety ? To be fair, I
have only read the PR description 😅
Hi @ashutosh-arm sorry for my late reply. As I see it, the main purpose of
the new option is to show the correspondence between the operators from the
original graph and the final operations offloading on the target. This is
displayed as a sequential printout of the source relay's operations, with the
composites from which they are derived and the target to which they are
unloaded.
Another point worth highlighting is the partitioned Relay, which is an
output of --dump-code="relay", have relay operation's numbers ( %...) different
from those in the initial Relay. Therefore, the new knob, which keeps the
initial Relay's numbers, can be handy
Here is an example output with the new option:
'ethos-u <- ethos-u.qnn_conv2d'
'ethos-u <- %204 = qnn.conv2d(%203, %v_param_105, -128, 0,
0.0235294f, ...'
'ethos-u <- %205 = nn.bias_add(%204, %v_param_106, axis=3);'
'ethos-u <- %206 = qnn.requantize(%205,
meta[relay.Constant][105], 0, ...'
'ethos-u <- ethos-u.reshape'
'ethos-u <- %207 = reshape(%206, newshape=[1, 1001]);'
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]