[ 
https://issues.apache.org/jira/browse/ARROW-11782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17291419#comment-17291419
 ] 

Weston Pace commented on ARROW-11782:
-------------------------------------

Thanks for pointing this out.  I think we can't get rid of ScanTask.  I will 
create a ScanTaskInternal (or some similar name) for the work I'm doing and 
leave ScanTask's interface as is.

In the future though it might be better to change Scanner::Scan to return a 
RecordBatchIterator instead of ScanTaskIterator.  Otherwise we're putting the 
burden on the user to manage how many files are loaded concurrently.  I think 
that should either be hidden completely or controlled by some option 
(MaxConcurrentFiles or, better yet, MaxAllocatedBytes)

> [GLib][Dataset] Remove bindings for internal classes
> ----------------------------------------------------
>
>                 Key: ARROW-11782
>                 URL: https://issues.apache.org/jira/browse/ARROW-11782
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: GLib
>    Affects Versions: 3.0.0
>            Reporter: Ben Kietzman
>            Priority: Major
>             Fix For: 4.0.0
>
>
> GLib and ruby include bindings for internal classes such as ScanOptions, 
> ScanContext, InMemoryScanTask, ScanTask, ... These are probably unnecessary 
> and should be removed to present a simpler interface less prone to breakage 
> under refactoring of the wrapped classes 
> https://github.com/apache/arrow/pull/9532/checks?check_run_id=1974229719#step:8:2071



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to