Hi everyone,

I would like to start a discussion on FLINK-25132: KafkaSource cannot work
with object-reusing DeserializationSchema.

The implementation of FLINK-25132 moves the deserialization of kafka
records from SplitFetcher thread to thread that runs KafkaRecordEmitter,
namely flink's main task thread. In our experience, even without triggering
this object-reusing bug in KafkaDeserializerSchema , it still caused a
performance regression of 20% to 30%.

I tend to add a parameter to flink conf to indicate where the
deserialization process needs to be placed, that is, in the fetcher thread
or the main thread, since not all deserializer will encounter the mentioned
object-reusing bug.

Looking forward to your reply.

Best,
Jun He

Reply via email to