[ https://issues.apache.org/jira/browse/FLINK-14135?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16961663#comment-16961663 ]
Jingsong Lee commented on FLINK-14135: -------------------------------------- [~jark] Can you assign this ticket to me? > Introduce vectorized orc InputFormat for blink runtime > ------------------------------------------------------- > > Key: FLINK-14135 > URL: https://issues.apache.org/jira/browse/FLINK-14135 > Project: Flink > Issue Type: Sub-task > Components: Connectors / ORC > Reporter: Jingsong Lee > Priority: Major > > VectorizedOrcInputFormat is introduced to read orc data in batches. > When returning each row of data, instead of actually retrieving each field, > we use BaseRow's abstraction to return a Columnar Row-like view. > This will greatly improve the downstream filtered scenarios, so that there is > no need to access redundant fields on the filtered data. -- This message was sent by Atlassian Jira (v8.3.4#803005)