andygrove opened a new issue, #15:
URL: https://github.com/apache/arrow-ballista/issues/15

   **Is your feature request related to a problem or challenge? Please describe 
what you are trying to do.**
   We have Python bindings for DataFusion's ExecutionContext. It would be good 
to also support Ballista's BallistaContext so that we can use Python to run 
distributed queries.
   
   **Describe the solution you'd like**
   Probably something like this?
   
   ```python
   import ballista
   
   ctx = ballista.BallistaContext
   df = ctx.read_parquet(...)
   ```
   
   **Describe alternatives you've considered**
   Another approach might be to have ballista be an optional feature of 
DataFusion and then enable new methods on the DataFusion ExecutionContext 
instead but that would probably result in tons of additional dependencies and 
blur the lines between DataFusion and Ballista and I think there is a strong 
case for DataFusion=lib/embedded and Ballista=distributed.
   
   
   **Additional context**
   N/A


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to