[ 
https://issues.apache.org/jira/browse/IMPALA-11185?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Quanlong Huang resolved IMPALA-11185.
-------------------------------------
    Fix Version/s: Impala 4.1.0
       Resolution: Fixed

> Reuse orc::ColumnVectorBatch in the scanner life-cycle
> ------------------------------------------------------
>
>                 Key: IMPALA-11185
>                 URL: https://issues.apache.org/jira/browse/IMPALA-11185
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>            Reporter: Quanlong Huang
>            Assignee: Quanlong Huang
>            Priority: Major
>             Fix For: Impala 4.1.0
>
>         Attachments: tpch-q1-scanner-flame-graph.jpg
>
>
> In HdfsOrcScanner::AssembleRows(), we always re-create a 
> orc::ColumnVectorBatch. The ideal pattern is reusing the batch and only 
> destroyed it when the scanner is closed.
> In the flame graph of TPC-H Q1 collected by [~drorke] , the createRowBatch 
> and destructors occupies almost half of the scanner time.
> !tpch-q1-scanner-flame-graph.jpg|width=979,height=426!



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to