​Hello all,


We have a kafka topic with lots of partitions where data is partitioned by an 
upstream publisher on "session".



In flink we read this topic and another single partition topic which contains 
configuration definitions for a little flatMap based operation.  We also do a 
little bit of processing on the incoming data before combining it with the 
configuration in a soft  join-like operation.



forgive the ascii art:

​
N-partition Kafka "Data" source -> Map -----\
                                             |-> connect -> CoFlatMap -> ...
1-partition Kafka "Config" source -> global-/

What we *think* are seeing is that this is broken up into 3 tasks and that the 
ordering of events in the kafka "Data" source isn't maintained when we see it 
in the CoFlatMap.  I have tried to add a custom partitioner on "Session" before 
the connect and it seemed to not help.  I can't use KeyBy because the Config 
stream has no "Session key"

Should we be able to assume anything about ordering of events without 
explicitly windowing/sorting/chaining?

-Bart




________________________________
This e-mail may contain CONFIDENTIAL AND PROPRIETARY INFORMATION and/or 
PRIVILEGED AND CONFIDENTIAL COMMUNICATION intended solely for the recipient 
and, therefore, may not be retransmitted to any party outside of the 
recipient's organization without the prior written consent of the sender. If 
you have received this e-mail in error please notify the sender immediately by 
telephone or reply e-mail and destroy the original message without making a 
copy. Deep Silver, Inc. accepts no liability for any losses or damages 
resulting from infected e-mail transmissions and viruses in e-mail attachments.

Reply via email to