[jira] [Comment Edited] (ARROW-1012) [C++] Create implementation of RecordBatchReader that reads from Apache Parquet files

Wes McKinney (JIRA) Tue, 07 May 2019 08:33:15 -0700


    [ 
https://issues.apache.org/jira/browse/ARROW-1012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16834881#comment-16834881
 ]


Wes McKinney edited comment on ARROW-1012 at 5/7/19 3:32 PM:
-------------------------------------------------------------

[~hatem] yes -- the objective is to be able to configure a maximum record batch 
length (e.g. 64K, as an example) that obtain a stream of record batches no 
larger than that size. The {{GetRecordBatchReader}} returns an object that 
performs entire-row-group reads without configuration. This should be clarified 
in the JIRA description


was (Author: wesmckinn):
[~hatem] yes -- the objective is to be able to configure a maximum record batch 
length (e.g. 64K, as an example) that obtain a stream of record batches no 
larger than that size. The {{GetRecordBatchReader}} returns an object that 
performs entire-row-group reads without configuration

> [C++] Create implementation of RecordBatchReader that reads from Apache 
> Parquet files
> -------------------------------------------------------------------------------------
>
>                 Key: ARROW-1012
>                 URL: https://issues.apache.org/jira/browse/ARROW-1012
>             Project: Apache Arrow
>          Issue Type: New Feature
>          Components: C++
>            Reporter: Wes McKinney
>            Priority: Major
>              Labels: parquet
>             Fix For: 0.14.0
>
>
> This will be enabled by ARROW-1008



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Comment Edited] (ARROW-1012) [C++] Create implementation of RecordBatchReader that reads from Apache Parquet files

Reply via email to