TomScheffers commented on issue #2631: URL: https://github.com/apache/arrow/issues/2631#issuecomment-778799406
Hi @hugo-pires, while we are waiting for the pyarrow package to mature, I made my own [package](https://github.com/TomScheffers/pyarrow_ops) to perform pandas like operations on the pyarrow.Table directly. Currently it supports (inner) join, groupby (iterables and aggregations), filters (using multiple predicates), drop_duplicates and head (for printing). Although it is not yet as extensive yet as pandas, performance actually very similar. Feel free to contact me if you need anything! ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
