Hello Flink Experts.


We have Flink job consuming data from Kafka and ingest it to multi-site 
(Azure-east – Azure-west) replicated Cassandra.

Now we have to aggregate data hourly. The problem is that device X can report 
once on site A and once on site B. This means that some messages for that 
device, will be processed by Flink in site A and some messages will be 
processed on site B.

I want an aggregation result that will reflect all messages transmitted by 
specific device X.

Are there any best practices to handle multi-site ingestion?

Any idea how to handle the scenario above?

Thanks in advance.

Reply via email to