seddonm1 opened a new pull request #9485:
URL: https://github.com/apache/arrow/pull/9485


   The DataFrame API has a `collect` method which invokes the `collect(plan: 
Arc<dyn ExecutionPlan>) -> Result<Vec<RecordBatch>>` function which will 
collect records into a single vector of RecordBatches removing any partitioning 
via `MergeExec`.
   
   This PR adds the DataFrame `collect_partitioned` method so that partitioning 
can be maintained. This allows easy passing into a new `MemTable`.
   
   @andygrove 


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to