GroupByKey is generating fake duplicates on Dataflow

2020-12-18 Thread bits horoscope
Hi Apache Beam community, I have been dealing with a bug in a GroupByKey step. I'm reading an XML file with many info, something like this. ABC-37717 First Listing ABC-37718 Second listing ABC-37719 Third listing I want to work only with the listings with unique code and discard the dupl

Handshake_failure running a Dataflow pipeline

2020-08-27 Thread bits horoscope
Hello, Apache Beam community! Hope everything goes ok at this time. I write to you asking for your guide and help. I have been facing some problems accessing HTTPS resources from a pipeline deployed in Dataflow. The problem occurs only when I run with DataflowRunner, the DirectRunner is working o