Jingsong Lee created FLINK-11899: ------------------------------------ Summary: Introduce VectorizedColumnRowInputParquetFormat for blink runtime Key: FLINK-11899 URL: https://issues.apache.org/jira/browse/FLINK-11899 Project: Flink Issue Type: New Feature Reporter: Jingsong Lee Assignee: Jingsong Lee
Vectorized Column Row Input Parquet Format is introduced to read parquet data in batches. When returning each row of data, instead of actually retrieving each field, we use BaseRow's abstraction to return a Columnar Row-like view. This will greatly improve the downstream filtered scenarios, so that there is no need to access redundant fields on the filtered data. -- This message was sent by Atlassian JIRA (v7.6.3#76005)