wjones127 commented on issue #414:
URL: 
https://github.com/apache/arrow-datafusion-python/issues/414#issuecomment-1602003194

   > I think Delta rust is using Datafusion internally
   
   There's three senses in which we integrate with DataFusion:
   
   1. We use DataFusion components inside of our own functions
   2. We have a plugin for Rust DataFusion, but that can only be used from Rust
   3. We can export PyArrow datasets, which datafusion-python can read.
   
   It's only the third one that applies to this library.
   
   > I could not find any documentation though how to use Delta table with 
Python datafusion
   
   Our integration with the Python DataFusion is similar to DuckDB: create a 
PyArrow dataset, import that into DataFusion, and query as desired.
   
   ```python
   from datafusion import SessionContext
   from deltalake import DeltaTable
   
   # Create a DataFusion context
   ctx = SessionContext()
   delta_table = DeltaTable("path/to/your/table")
   ctx.register_dataset(delta_table.to_pyarrow_dataset(), table_name="my_table")
   
   df = ctx.sql("SELECT * FROM my_table")
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to