Mike Seddon created ARROW-11616: ----------------------------------- Summary: [Rust][DataFusion] Expose collect_partitioned for DataFrame Key: ARROW-11616 URL: https://issues.apache.org/jira/browse/ARROW-11616 Project: Apache Arrow Issue Type: Improvement Components: Rust - DataFusion Reporter: Mike Seddon Assignee: Mike Seddon
The DataFrame API has a `collect` method which invokes the `collect(plan: Arc<dyn ExecutionPlan>) -> Result<Vec<RecordBatch>>` function which will collect records into a single vector of RecordBatches removing the partitioning via `MergeExec`. The DataFrame should also expose the `collect_partitioned` method so that partitions can be maintained. ``` collect_partitioned( plan: Arc<dyn ExecutionPlan>, ) -> Result<Vec<Vec<RecordBatch>>> ``` -- This message was sent by Atlassian Jira (v8.3.4#803005)