[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-11-18 Thread Apache Arrow JIRA Bot (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635972#comment-17635972
 ] 

Apache Arrow JIRA Bot commented on ARROW-8163:
--

This issue was last updated over 90 days ago, which may be an indication it is 
no longer being actively worked. To better reflect the current state, the issue 
is being unassigned per [project 
policy|https://arrow.apache.org/docs/dev/developers/bug_reports.html#issue-assignment].
 Please feel free to re-take assignment of the issue if it is being actively 
worked, or if you plan to start that work soon.

> [C++][Dataset] Allow FileSystemDataset's file list to be lazy
> -
>
> Key: ARROW-8163
> URL: https://issues.apache.org/jira/browse/ARROW-8163
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.16.0
>Reporter: Ben Kietzman
>Assignee: Pavel Solodovnikov
>Priority: Major
>  Labels: dataset
>
> A FileSystemDataset currently requires a full listing of files it contains on 
> construction, so a scan cannot start until all files in the dataset are 
> discovered. Instead it would be ideal if a large dataset could be constructed 
> with a lazy file listing so that scans can start immediately.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-08-19 Thread Aldrin Montana (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582080#comment-17582080
 ] 

Aldrin Montana commented on ARROW-8163:
---

made a comment in ARROW-17306 about a minor PR, but I didn't think it needed to 
be re-opened. Just wanted to ping here so that it has a bit more visibility

> [C++][Dataset] Allow FileSystemDataset's file list to be lazy
> -
>
> Key: ARROW-8163
> URL: https://issues.apache.org/jira/browse/ARROW-8163
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.16.0
>Reporter: Ben Kietzman
>Assignee: Pavel Solodovnikov
>Priority: Major
>  Labels: dataset
>
> A FileSystemDataset currently requires a full listing of files it contains on 
> construction, so a scan cannot start until all files in the dataset are 
> discovered. Instead it would be ideal if a large dataset could be constructed 
> with a lazy file listing so that scans can start immediately.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-07-18 Thread Pavel Solodovnikov (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568181#comment-17568181
 ] 

Pavel Solodovnikov commented on ARROW-8163:
---

[~westonpace] Thanks!

> [C++][Dataset] Allow FileSystemDataset's file list to be lazy
> -
>
> Key: ARROW-8163
> URL: https://issues.apache.org/jira/browse/ARROW-8163
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.16.0
>Reporter: Ben Kietzman
>Assignee: Pavel Solodovnikov
>Priority: Major
>  Labels: dataset
>
> A FileSystemDataset currently requires a full listing of files it contains on 
> construction, so a scan cannot start until all files in the dataset are 
> discovered. Instead it would be ideal if a large dataset could be constructed 
> with a lazy file listing so that scans can start immediately.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-07-18 Thread Weston Pace (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568144#comment-17568144
 ] 

Weston Pace commented on ARROW-8163:


[~psolodovnikov] I have assigned the issue to you.  You should also now have 
the permission to assign issues to yourself.

> [C++][Dataset] Allow FileSystemDataset's file list to be lazy
> -
>
> Key: ARROW-8163
> URL: https://issues.apache.org/jira/browse/ARROW-8163
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.16.0
>Reporter: Ben Kietzman
>Assignee: Pavel Solodovnikov
>Priority: Major
>  Labels: dataset
>
> A FileSystemDataset currently requires a full listing of files it contains on 
> construction, so a scan cannot start until all files in the dataset are 
> discovered. Instead it would be ideal if a large dataset could be constructed 
> with a lazy file listing so that scans can start immediately.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-07-18 Thread Pavel Solodovnikov (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17568021#comment-17568021
 ] 

Pavel Solodovnikov commented on ARROW-8163:
---

I plan to start working on this item soon, can you please assign it to me?

> [C++][Dataset] Allow FileSystemDataset's file list to be lazy
> -
>
> Key: ARROW-8163
> URL: https://issues.apache.org/jira/browse/ARROW-8163
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.16.0
>Reporter: Ben Kietzman
>Priority: Major
>  Labels: dataset
>
> A FileSystemDataset currently requires a full listing of files it contains on 
> construction, so a scan cannot start until all files in the dataset are 
> discovered. Instead it would be ideal if a large dataset could be constructed 
> with a lazy file listing so that scans can start immediately.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)


[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2021-02-24 Thread Ben Kietzman (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17290623#comment-17290623
 ] 

Ben Kietzman commented on ARROW-8163:
-

cc [~westonpace]

> [C++][Dataset] Allow FileSystemDataset's file list to be lazy
> -
>
> Key: ARROW-8163
> URL: https://issues.apache.org/jira/browse/ARROW-8163
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Affects Versions: 0.16.0
>Reporter: Ben Kietzman
>Assignee: Ben Kietzman
>Priority: Major
>  Labels: dataset
>
> A FileSystemDataset currently requires a full listing of files it contains on 
> construction, so a scan cannot start until all files in the dataset are 
> discovered. Instead it would be ideal if a large dataset could be constructed 
> with a lazy file listing so that scans can start immediately.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)