[ 
https://issues.apache.org/jira/browse/FLINK-14135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961663#comment-16961663
 ] 

Jingsong Lee commented on FLINK-14135:
--------------------------------------

[~jark] Can you assign this ticket to me?

> Introduce vectorized orc InputFormat for blink runtime
> -------------------------------------------------------
>
>                 Key: FLINK-14135
>                 URL: https://issues.apache.org/jira/browse/FLINK-14135
>             Project: Flink
>          Issue Type: Sub-task
>          Components: Connectors / ORC
>            Reporter: Jingsong Lee
>            Priority: Major
>
> VectorizedOrcInputFormat is introduced to read orc data in batches.
> When returning each row of data, instead of actually retrieving each field, 
> we use BaseRow's abstraction to return a Columnar Row-like view.
> This will greatly improve the downstream filtered scenarios, so that there is 
> no need to access redundant fields on the filtered data.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to