Ben Kietzman created ARROW-11797: ------------------------------------ Summary: [C++][Dataset] Provide Scanner methods to yield/visit scanned batches Key: ARROW-11797 URL: https://issues.apache.org/jira/browse/ARROW-11797 Project: Apache Arrow Issue Type: Improvement Components: C++ Affects Versions: 3.0.0 Reporter: Ben Kietzman Assignee: Ben Kietzman Fix For: 4.0.0
>From discussion in https://issues.apache.org/jira/browse/ARROW-11782 It'd be useful to consumers of Scanner to receive an iterator of scanned record batches or apply a visitor to batches as they are scanned without handling ScanTasks. For example, this could enable aggregations or other computations which don't require the entire table to be materialized. -- This message was sent by Atlassian Jira (v8.3.4#803005)