[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17582080#comment-17582080 ]
Aldrin Montana commented on ARROW-8163: --------------------------------------- made a comment in ARROW-17306 about a minor PR, but I didn't think it needed to be re-opened. Just wanted to ping here so that it has a bit more visibility > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > ------------------------------------------------------------- > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ > Affects Versions: 0.16.0 > Reporter: Ben Kietzman > Assignee: Pavel Solodovnikov > Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.20.10#820010)