[ 
https://issues.apache.org/jira/browse/ARROW-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-6969:
----------------------------------
    Labels: dataset pull-request-available  (was: dataset)

> [C++][Dataset] ParquetScanTask eagerly load file 
> -------------------------------------------------
>
>                 Key: ARROW-6969
>                 URL: https://issues.apache.org/jira/browse/ARROW-6969
>             Project: Apache Arrow
>          Issue Type: Improvement
>            Reporter: Francois Saint-Jacques
>            Assignee: Francois Saint-Jacques
>            Priority: Major
>              Labels: dataset, pull-request-available
>
> The file content should only be read when invoking ParquetScanTask::Scan, not 
> on construction. This blocks reading in a true streaming fashion with memory 
> constraints.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to