Hi, I saw really nice features like groupby and join developed recently. I like how Dataset is supported for joins and how streamed processing is gaining momentum in Arrow.
Does Apache Arrow have the concept of remote datasets eg using Arrow Flight? Or will this happen directly using S3 and other protocols only? I know some work has started in Substrait, but that might be a whole new level of integration, hence my question focusing on data first. I was trying to browse the JIRA issues, but the future picture wasn't clear based on that Best regards, Adam Lippai