[
https://issues.apache.org/jira/browse/ARROW-16716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17545087#comment-17545087
]
Li Jin commented on ARROW-16716:
--------------------------------
[~ichauster] is an intern that is going to work on Acero benchmarks. I talked
to @Weston Pace offline and seems Projection is a good place to start.
Pasting [~westonpace] 's comments on this "
* Presumably, for a complex expression, with a large enough batch
size, the majority of time will be spent in the kernel functions.
* How far can you shrink the exec batch before the overhead of the
node impacts runtime?
* How complex does the expression need to be?
* I expect the results to be very similar to
ExecuteScalarExpressionOverhead in expression_benchmark.cc. Is it?
If not, what is the difference?
* What is the data rate of the project node (in bytes/second) for all
of the above?
* For all of the above run with both 1 thread and 1 thread per core
The ExecuteScalarExpressionOverhead benchmarks would be a good
existing example that should be pretty similar to how we benchmark
project node.
!https://ssl.gstatic.com/ui/v1/icons/mail/images/cleardot.gif!
"
> [Benchmarks] Create Projection benchmark for Acero
> --------------------------------------------------
>
> Key: ARROW-16716
> URL: https://issues.apache.org/jira/browse/ARROW-16716
> Project: Apache Arrow
> Issue Type: Improvement
> Reporter: Li Jin
> Priority: Major
>
--
This message was sent by Atlassian Jira
(v8.20.7#820007)