[
https://issues.apache.org/jira/browse/IMPALA-11185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Quanlong Huang resolved IMPALA-11185.
-------------------------------------
Fix Version/s: Impala 4.1.0
Resolution: Fixed
> Reuse orc::ColumnVectorBatch in the scanner life-cycle
> ------------------------------------------------------
>
> Key: IMPALA-11185
> URL: https://issues.apache.org/jira/browse/IMPALA-11185
> Project: IMPALA
> Issue Type: Improvement
> Components: Backend
> Reporter: Quanlong Huang
> Assignee: Quanlong Huang
> Priority: Major
> Fix For: Impala 4.1.0
>
> Attachments: tpch-q1-scanner-flame-graph.jpg
>
>
> In HdfsOrcScanner::AssembleRows(), we always re-create a
> orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only
> destroyed it when the scanner is closed.
> In the flame graph of TPC-H Q1 collected by [~drorke] , the createRowBatch
> and destructors occupies almost half of the scanner time.
> !tpch-q1-scanner-flame-graph.jpg|width=979,height=426!
--
This message was sent by Atlassian Jira
(v8.20.1#820001)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]