Hello guys, I have a job with multiple Kafka sources. They all contain certain historical data. If you use the events-time window, it will cause sources with less data to cover more sources through water mark.
I can think of a solution, Implement a scheduler in the source phase, But it is quite complicated to implement. Are ther otherbetter solutions? Any suggestions? Thanks!