[ https://issues.apache.org/jira/browse/ARROW-6969?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Francois Saint-Jacques resolved ARROW-6969. ------------------------------------------- Fix Version/s: 1.0.0 Resolution: Fixed Issue resolved by pull request 5725 [https://github.com/apache/arrow/pull/5725] > [C++][Dataset] ParquetScanTask eagerly load file > ------------------------------------------------- > > Key: ARROW-6969 > URL: https://issues.apache.org/jira/browse/ARROW-6969 > Project: Apache Arrow > Issue Type: Improvement > Components: C++, Dataset > Reporter: Francois Saint-Jacques > Assignee: Francois Saint-Jacques > Priority: Major > Labels: dataset, pull-request-available > Fix For: 1.0.0 > > Time Spent: 0.5h > Remaining Estimate: 0h > > The file content should only be read when invoking ParquetScanTask::Scan, not > on construction. This blocks reading in a true streaming fashion with memory > constraints. -- This message was sent by Atlassian Jira (v8.3.4#803005)