[GitHub] [arrow] seddonm1 opened a new pull request #9485: ARROW-11616: [Rust][DataFusion] Add collect_partitioned on DataFrame

GitBox Fri, 12 Feb 2021 18:46:44 -0800


seddonm1 opened a new pull request #9485:
URL: https://github.com/apache/arrow/pull/9485



   The DataFrame API has a `collect` method which invokes the `collect(plan: 
Arc<dyn ExecutionPlan>) -> Result<Vec<RecordBatch>>` function which will 
collect records into a single vector of RecordBatches removing any partitioning 
via `MergeExec`.
   
   This PR adds the DataFrame `collect_partitioned` method so that partitioning 
can be maintained. This allows easy passing into a new `MemTable`.
   
   @andygrove 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[GitHub] [arrow] seddonm1 opened a new pull request #9485: ARROW-11616: [Rust][DataFusion] Add collect_partitioned on DataFrame

Reply via email to