[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635972#comment-17635972 ] Apache Arrow JIRA Bot commented on ARROW-8163: -- This issue was last updated over 90 days ago, which may be an indication it is no longer being actively worked. To better reflect the current state, the issue is being unassigned per [project policy|https://arrow.apache.org/docs/dev/developers/bug_reports.html#issue-assignment]. Please feel free to re-take assignment of the issue if it is being actively worked, or if you plan to start that work soon. > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > - > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Affects Versions: 0.16.0 >Reporter: Ben Kietzman >Assignee: Pavel Solodovnikov >Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582080#comment-17582080 ] Aldrin Montana commented on ARROW-8163: --- made a comment in ARROW-17306 about a minor PR, but I didn't think it needed to be re-opened. Just wanted to ping here so that it has a bit more visibility > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > - > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Affects Versions: 0.16.0 >Reporter: Ben Kietzman >Assignee: Pavel Solodovnikov >Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568181#comment-17568181 ] Pavel Solodovnikov commented on ARROW-8163: --- [~westonpace] Thanks! > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > - > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Affects Versions: 0.16.0 >Reporter: Ben Kietzman >Assignee: Pavel Solodovnikov >Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568144#comment-17568144 ] Weston Pace commented on ARROW-8163: [~psolodovnikov] I have assigned the issue to you. You should also now have the permission to assign issues to yourself. > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > - > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Affects Versions: 0.16.0 >Reporter: Ben Kietzman >Assignee: Pavel Solodovnikov >Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568021#comment-17568021 ] Pavel Solodovnikov commented on ARROW-8163: --- I plan to start working on this item soon, can you please assign it to me? > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > - > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Affects Versions: 0.16.0 >Reporter: Ben Kietzman >Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.20.10#820010)
[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17290623#comment-17290623 ] Ben Kietzman commented on ARROW-8163: - cc [~westonpace] > [C++][Dataset] Allow FileSystemDataset's file list to be lazy > - > > Key: ARROW-8163 > URL: https://issues.apache.org/jira/browse/ARROW-8163 > Project: Apache Arrow > Issue Type: Improvement > Components: C++ >Affects Versions: 0.16.0 >Reporter: Ben Kietzman >Assignee: Ben Kietzman >Priority: Major > Labels: dataset > > A FileSystemDataset currently requires a full listing of files it contains on > construction, so a scan cannot start until all files in the dataset are > discovered. Instead it would be ideal if a large dataset could be constructed > with a lazy file listing so that scans can start immediately. -- This message was sent by Atlassian Jira (v8.3.4#803005)