TomScheffers commented on issue #2189: URL: https://github.com/apache/arrow/issues/2189#issuecomment-778802864
For my own projects, I wrote the [pyarrow_ops package](https://github.com/TomScheffers/pyarrow_ops) which performs pandas like operations on the pyarrow.Table directly. Currently it supports join, groupby, filters, drop_duplicates and head operations, but it can be easily extended. Maybe people are interested in extending on this work or optimizing it (for example, by calling more arrow.compute operations directly)? Let me know! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
