jcalvarezj commented on issue #24148: URL: https://github.com/apache/beam/issues/24148#issuecomment-1314803183
> I believe the main way you can do this in Python is directly using [`CoGroupByKey`](https://beam.apache.org/releases/pydoc/2.42.0/apache_beam.transforms.util.html?highlight=cogroupbykey#apache_beam.transforms.util.CoGroupByKey) > > Your request is to have a friendly wrapper for an equijoin on a particular schema field of two schema-aware PCollections? This seems like a possible project as a first contribution. A bit advanced, but doable. > > CC @tvalentyn That's right, I've worked with CoGroupByKey successfully with a couple workarounds, but I feel that schema-aware PCollections Join operations would save a lot of repetitive work on many use cases (as it's a more direct and friendly way as far as I've seen in Java implementations) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
