[
https://issues.apache.org/jira/browse/ARROW-1012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16834881#comment-16834881
]
Wes McKinney edited comment on ARROW-1012 at 5/7/19 3:32 PM:
-------------------------------------------------------------
[~hatem] yes -- the objective is to be able to configure a maximum record batch
length (e.g. 64K, as an example) that obtain a stream of record batches no
larger than that size. The {{GetRecordBatchReader}} returns an object that
performs entire-row-group reads without configuration. This should be clarified
in the JIRA description
was (Author: wesmckinn):
[~hatem] yes -- the objective is to be able to configure a maximum record batch
length (e.g. 64K, as an example) that obtain a stream of record batches no
larger than that size. The {{GetRecordBatchReader}} returns an object that
performs entire-row-group reads without configuration
> [C++] Create implementation of RecordBatchReader that reads from Apache
> Parquet files
> -------------------------------------------------------------------------------------
>
> Key: ARROW-1012
> URL: https://issues.apache.org/jira/browse/ARROW-1012
> Project: Apache Arrow
> Issue Type: New Feature
> Components: C++
> Reporter: Wes McKinney
> Priority: Major
> Labels: parquet
> Fix For: 0.14.0
>
>
> This will be enabled by ARROW-1008
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)