In the future, these kinds of questions are more appropriate for the user
mailing list (u...@flink.apache.org). Dev is for internal Flink development.

On Wed, May 15, 2019 at 12:06 PM Seth Wiesman <s...@ververica.com> wrote:

> Hi Gregory,
>
> The easiest solution would be to include the site in your key so that at
> query time the rows from each site can be aggregated together.
>
> Instead of <Key, Value>, the table would be <Key, Site, Value> and your
> query would become Select sum(value) FROM table GROUP BY key;
>
> Otherwise, you will need to get all that data into a single site to
> perform a final aggregation prior to writing to Cassandra.
>
> On Wed, May 15, 2019 at 3:45 AM Melekh, Gregory <
> gregory.mel...@intl.att.com> wrote:
>
>> Hello Flink Experts.
>>
>>
>>
>> We have Flink job consuming data from Kafka and ingest it to multi-site
>> (Azure-east – Azure-west) replicated Cassandra.
>>
>> Now we have to aggregate data hourly. The problem is that device X can
>> report once on site A and once on site B. This means that some messages for
>> that device, will be processed by Flink in site A and some messages will be
>> processed on site B.
>>
>> I want an aggregation result that will reflect all messages transmitted
>> by specific device X.
>>
>> Are there any best practices to handle multi-site ingestion?
>>
>> Any idea how to handle the scenario above?
>>
>> Thanks in advance.
>>
>>
>
> --
>
> Seth Wiesman | Solutions Architect
>
> +1 314 387 1463
>
> <https://www.ververica.com/>
>
> Follow us @VervericaData
>
> --
>
> Join Flink Forward <https://flink-forward.org/> - The Apache Flink
> Conference
>
> Stream Processing | Event Driven | Real Time
>
> --
>
> Data Artisans GmbH | Invalidenstrasse 115, 10115 Berlin, Germany
>
> --
> Data Artisans GmbH
> Registered at Amtsgericht Charlottenburg: HRB 158244 B
> Managing Directors: Dr. Kostas Tzoumas, Dr. Stephan Ewen
>


-- 

Seth Wiesman | Solutions Architect

+1 314 387 1463

<https://www.ververica.com/>

Follow us @VervericaData

--

Join Flink Forward <https://flink-forward.org/> - The Apache Flink
Conference

Stream Processing | Event Driven | Real Time

--

Data Artisans GmbH | Invalidenstrasse 115, 10115 Berlin, Germany

--
Data Artisans GmbH
Registered at Amtsgericht Charlottenburg: HRB 158244 B
Managing Directors: Dr. Kostas Tzoumas, Dr. Stephan Ewen

Reply via email to