[ https://issues.apache.org/jira/browse/ARROW-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
ASF GitHub Bot updated ARROW-6969: ---------------------------------- Labels: dataset pull-request-available (was: dataset) > [C++][Dataset] ParquetScanTask eagerly load file > ------------------------------------------------- > > Key: ARROW-6969 > URL: https://issues.apache.org/jira/browse/ARROW-6969 > Project: Apache Arrow > Issue Type: Improvement > Reporter: Francois Saint-Jacques > Assignee: Francois Saint-Jacques > Priority: Major > Labels: dataset, pull-request-available > > The file content should only be read when invoking ParquetScanTask::Scan, not > on construction. This blocks reading in a true streaming fashion with memory > constraints. -- This message was sent by Atlassian Jira (v8.3.4#803005)