[ 
https://issues.apache.org/jira/browse/ARROW-16716?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17545087#comment-17545087
 ] 

Li Jin commented on ARROW-16716:
--------------------------------

[~ichauster] is an intern that is going to work on Acero benchmarks. I talked 
to @Weston Pace offline and seems Projection is a good place to start.

Pasting [~westonpace] 's comments on this "
 * Presumably, for a complex expression, with a large enough batch
size, the majority of time will be spent in the kernel functions.
    * How far can you shrink the exec batch before the overhead of the
node impacts runtime?
    * How complex does the expression need to be?
 * I expect the results to be very similar to
ExecuteScalarExpressionOverhead in expression_benchmark.cc.  Is it?
If not, what is the difference?
 * What is the data rate of the project node (in bytes/second) for all
of the above?
 * For all of the above run with both 1 thread and 1 thread per core

The ExecuteScalarExpressionOverhead benchmarks would be a good
existing example that should be pretty similar to how we benchmark
project node.
!https://ssl.gstatic.com/ui/v1/icons/mail/images/cleardot.gif!
"

> [Benchmarks] Create Projection benchmark for Acero
> --------------------------------------------------
>
>                 Key: ARROW-16716
>                 URL: https://issues.apache.org/jira/browse/ARROW-16716
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Li Jin
>            Priority: Major
>




--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to