Re: FlatMapGroupsWithStateFunction is called thrice - Production use case.

2021-03-18 Thread Kuttaiah Robin
Hi Jungtaek, Thanks for looking into it. We use spark-2.4.3. I removed most of our code and pasted here just to understand the flow. Sorry for the delay. I would try to provide a simple reproducer when I find time, but this is really hurting us. Another observation I see is basically only if I

Re: FlatMapGroupsWithStateFunction is called thrice - Production use case.

2021-03-11 Thread Jungtaek Lim
Hi, Could you please provide the Spark version? Also it would be pretty much helpful if you could provide a simple reproducer, like placing your reproducer which can simply be built (mvn or gradle or sbt) into your Github repository, plus the set of input data to see the behavior. Worth to know

FlatMapGroupsWithStateFunction is called thrice - Production use case.

2021-03-10 Thread Kuttaiah Robin
Hello, I have a use case where I need to read events(non correlated) from a source kafka topic, then correlate and push forward to another target topic. I use spark structured streaming with FlatMapGroupsWithStateFunction along with GroupStateTimeout.ProcessingTimeTimeout() . After each