iammax commented on issue #541:
URL: 
https://github.com/apache/arrow-datafusion-python/issues/541#issuecomment-1849969601

   I was having a similar issue with version 31.0.0. Small example snip:
   ``` 
   import datafusion as dfu
   ctx = dfu.SessionContext()
   ctx.register_csv('delta', 'test.csv')
   result = ctx.sql('SELECT col1, COUNT(DISTINCT col2) FROM delta GROUP BY 
col1')
   ```
   This assigns result to a ```datafusion.Dataframe``` object as expected. I 
can see it has the correct values in it by printing it out in terminal/jupyter. 
However, if I do ```result.to_polars()``` (or to_anything else) I get the same 
error as the original post.
   
   ```
   ArrowInvalid: Schema at index 0 was different: 
   col1: int64
   COUNT(DISTINCT delta.col2): int64
   vs
   delta.col1: int64
   COUNT(DISTINCT delta.col2): int64
   ```
   
   However it works in version 33.0,0 (I think that's the current version), so 
I assume there was a fix. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to