Hi,

I saw really nice features like groupby and join developed recently.
I like how Dataset is supported for joins and how streamed processing is
gaining momentum in Arrow.

Does Apache Arrow have the concept of remote datasets eg using Arrow
Flight? Or will this happen directly using S3 and other protocols only? I
know some work has started in Substrait, but that might be a whole new
level of integration, hence my question focusing on data first.

I was trying to browse the JIRA issues, but the future picture wasn't clear
based on that

Best regards,
Adam Lippai

Reply via email to