Re: Sync two DataStreams

2020-04-04 Thread David Anderson
There are a few ways to pre-ingest data from a side input before beginning to process another stream. One is to use the State Processor API [1] to create a savepoint that has the data from that side input in its state. For a simple example of bootstrapping state into a savepoint, see [2]. Another

Sync two DataStreams

2020-04-03 Thread Georgi Stoyanov
Hi, I want to implement a flow where the data from one stream is needed to validate data for second stream when the job is started without a savepoint or checkpoint. Both of them are reading from kafka. I want the data in the first one to be fully read and then to check the events from the