Re: [DISCUSS] KIP-1033: Add Kafka Streams exception handler for exceptions occuring during processing

Matthias J. Sax Wed, 10 Apr 2024 15:06:41 -0700

Thanks for the KIP. Great discussion.

I am not sure if I understand the proposal from Bruno to hand in theprocessor node id? Isn't this internal (could not even find it quickly).We do have a processor name, right? Or do I mix up something?

Another question is about `ProcessingContext` -- it contains a lot of(potentially irrelevant?) metadata. We should think carefully about whatwe want to pass in and what not -- removing stuff is hard, but addingstuff is easy. It's always an option to create a new interface that onlyexposes stuff we find useful, and allows us to evolve this interfaceindependent of others. Re-using an existing interface always has thedanger to introduce an undesired coupling that could bite us in thefuture. -- It make total sense to pass in `RecordMetadata`, but`ProcessingContext` (even if already limited compared to`ProcessorContext`) still seems to be too broad? For example, there is`getStateStore()` and `schedule()` methods which I think we should notexpose.

The other interesting question is about "what record gets passed in".For the PAPI, passing in the Processor's input record make a lot ofsense. However, for DSL operators, I am not 100% sure? The DSL oftenuses internal types not exposed to the user, and thus I am not sure ifusers could write useful code for this case? -- In general, I stillagree that the handler should be implement with a try-catch around`Processor.process()` but it might not be too useful for DSL processor.Hence, I am wondering if we need to so something more in the DSL? Idon't have a concrete proposal (a few high level ideas only) and if wedon't do anything special for the DSL I am ok with moving forward withthis KIP as-is, but we should be aware of potential limitations for DSLusers. We can always do a follow up KIP to close gaps when we understandthe impact better -- covering the DSL would also expand the scope ofthis KIP significantly...

About the metric: just to double check. Do we think it's worth to add anew metric? Or could we re-use the existing "dropped record metric"?




-Matthias


On 4/10/24 5:11 AM, Sebastien Viale wrote:

Hi,

You are right, it will simplify types.

We update the KIP

regards

Sébastien *VIALE***

*MICHELIN GROUP* - InfORMATION Technology
*Technical Expert Kafka*

  Carmes / Bâtiment A17 4e / 63100 Clermont-Ferrand

------------------------------------------------------------------------
*De :* Bruno Cadonna <cado...@apache.org>
*Envoyé :* mercredi 10 avril 2024 10:38
*À :* dev@kafka.apache.org <dev@kafka.apache.org>

*Objet :* [EXT] Re: [DISCUSS] KIP-1033: Add Kafka Streams exceptionhandler for exceptions occuring during processingWarning External sender Do not click on any links or open anyattachments unless you trust the sender and know the content is safe.


Hi Loïc, Damien, and Sébastien,

Great that we are converging!


3.
Damien and Loïc, I think in your examples the handler will receive
Record<Object, Object> because an Record<Object, Object> is passed to
the processor in the following code line:
https://github.com/apache/kafka/blob/044d058e03bde8179ed57994d0a77ae9bff9fc10/streams/src/main/java/org/apache/kafka/streams/processor/internals/ProcessorNode.java#L152
 
<https://github.com/apache/kafka/blob/044d058e03bde8179ed57994d0a77ae9bff9fc10/streams/src/main/java/org/apache/kafka/streams/processor/internals/ProcessorNode.java#L152>

I see that we do not need to pass into the the handler a Record<byte[],
byte[]> just because we do that for the DeserializationExceptionHandler
and the ProductionExceptionHandler. When those two handlers are called,
the record is already serialized. This is not the case for the
ProcessingExceptionHandler. However, I would propose to use Record<?, ?>
for the record that is passed to the ProcessingExceptionHandler because
it makes the handler API more flexible.


Best,
Bruno

This email was screened for spam and malicious content but exercisecaution anyway.





On 4/9/24 9:09 PM, Loic Greffier wrote:
 > Hi Bruno and Bill,
 >
 > To complete the Damien's purposes about the point 3.
 >

> Processing errors are caught and handled by theProcessingErrorHandler, at the precise moment when records are processedby processor nodes. The handling will be performed in the "process"method of the ProcessorNode, such as:

 >
 > public void process(final Record<KIn, VIn> record) {
 > ...
 >
 > try {
 > ...
 > } catch (final ClassCastException e) {
 > ...
 > } catch (Exception e) {

> ProcessingExceptionHandler.ProcessingHandlerResponse response =this.processingExceptionHandler

 > .handle(internalProcessorContext, (Record<Object, Object>) record, e);
 >

> if (response ==ProcessingExceptionHandler.ProcessingHandlerResponse.FAIL) {> throw new StreamsException("Processing exception handler is set tofail upon" +

 > " a processing error. If you would rather have the streaming pipeline" +
 > " continue after a processing error, please set the " +
 > DEFAULT_PROCESSING_EXCEPTION_HANDLER_CLASS_CONFIG + " appropriately.",
 > e);
 > }
 > }
 > }

> As you can see, the record is transmitted to theProcessingExceptionHandler as a Record<Object,Object>, as we are dealingwith the input record of the processor at this point. It can be anytype, including non-serializable types, as suggested by the Damien'sexample. As the ProcessingErrorHandler is not intended to perform anyserialization, there should be no issue for the users to handle aRecord<Object,Object>.

 >
 > I follow Damien on the other points.
 >
 > For point 6, underlying public interfaces are renamed as well:
 > - The ProcessingHandlerResponse

> - TheProcessingLogAndContinueExceptionHandler/ProcessingLogAndFailExceptionHandler> - The configuration DEFAULT_PROCESSING_EXCEPTION_HANDLER_CLASS_CONFIG(default.processing.exception.handler)

 >
 > Regards,
 >
 > Loïc
 >
 > De : Damien Gasparina <d.gaspar...@gmail.com>
 > Envoyé : mardi 9 avril 2024 20:08
 > À : dev@kafka.apache.org

> Objet : Re: [EXT] Re: [DISCUSS] KIP-1033: Add Kafka Streams exceptionhandler for exceptions occuring during processing

> Warning External sender Do not click on any links or open anyattachments unless you trust the sender and know the content is safe.

 >
 > Hi Bruno, Bill,
 >
 > First of all, thanks a lot for all your useful comments.
 >
 >> 1. and 2.
 >> I am wondering whether we should expose the processor node ID -- which
 >> basically is the processor node name -- in the ProcessingContext
 >> interface. I think the processor node ID fits well in the
 >> ProcessingContext interface since it already contains application ID and
 >> task ID and it would make the API for the handler cleaner.
 >

> That's a good point, the actual ProcessorContextImpl is alreadyholding the> current node in an attribute (currentNode), thus exposing the node IDshould

 > not be a problem. Let me sleep on it and get back to you regarding this
 > point.
 >
 >> 3.
 >> Could you elaborate -- maybe with an example -- when a record is in a
 >> state in which it cannot be serialized? This is not completely clear to
 > me.
 >
 > The Record passed to the handler is the input record to the processor. In
 > the Kafka Streams API, it could be any POJO.
 > e.g. with the following topology `
 > streamsBuilder.stream("x")
 > .map((k, v) -> new KeyValue("foo", Pair.of("hello",
 > "world")))
 > .forEach((k, v) -> throw new RuntimeException())
 > I would expect the handler to receive a Record<String, Pair<String,
 > String>>.
 >
 >> 4.
 >> Regarding the metrics, it is not entirely clear to me what the metric
 >> measures. Is it the number of calls to the process handler or is it the
 >> number of calls to process handler that returned FAIL?
 >> If it is the former, I was also wondering whether it would be better to
 >> put the task-level metrics to INFO reporting level and remove the
 >> thread-level metric, similar to the dropped-records metric. You can
 >> always roll-up the metrics to the thread level in your preferred
 >> monitoring system. Or do you think we end up with to many metrics?
 >
 > We were thinking of the former, measuring the number of calls to the
 > process handler. That's a good point, having the information at the task
 > level could be beneficial. I updated the KIP to change the metric level
 > and to clarify the wording.
 >
 >> 5.
 >> What do you think about naming the handler ProcessingExceptionHandler
 >> instead of ProcessExceptionHandler?
 >> The DeserializationExceptionHanlder and the ProductionExceptionHandler
 >> also use the noun of the action in their name and not the verb.
 >
 > Good catch, I updated the KIP to rename it ProcessingExceptionHandler.
 >
 >> 6.
 >> What record is exactly passed to the handler?
 >> Is it the input record to the task? Is it the input record to the
 >> processor node? Is it the input record to the processor?
 >
 > The input record of the processor. I assume that is the most user
 > friendly record in this context.
 >
 >> 7.
 >> Could you please add the packages of the Java classes/interfaces/enums
 >> you want to add?
 >
 > Done, without any surprises: package org.apache.kafka.streams.errors;
 >
 >
 > Thanks a lot for your reviews! Cheers,
 > Damien

> This email was screened for spam and malicious content but exercisecaution anyway.

 >
 >
 >
 >

> On Tue, 9 Apr 2024 at 18:04, Bill Bejeck<bbej...@gmail.com<mailto:bbej...@gmail.com>> wrote:

 >
 >> Hi Damien, Sebastien and Loic,
 >>
 >> Thanks for the KIP, this is a much-needed addition.
 >> I like the approach of getting the plumbing in for handling processor
 >> errors, allowing users to implement more complex solutions as needed.
 >>
 >> Overall how where the KIP Is now LGTM, modulo outstanding comments. I

>> think adding the example you included in this thread to the KIP is agreat

 >> idea.
 >>
 >> Regarding the metrics, I'm thinking along the same lines as Bruno. I'm

>> wondering if we can make do with a task-level metric at the INFOlevel and>> the processor metric at DEBUG. IMHO, when it comes to trackingexceptions>> in processing, these two areas are where users will want to focus,higher

 >> level metrics wouldn't be as useful in this case.
 >>
 >> Thanks,
 >> Bill
 >>

>> On Tue, Apr 9, 2024 at 6:54 AM Bruno Cadonna<cado...@apache.org<mailto:cado...@apache.org>> wrote:

 >>
 >>> Hi again,
 >>>
 >>> I have additional questions/comments.
 >>>
 >>> 6.
 >>> What record is exactly passed to the handler?
 >>> Is it the input record to the task? Is it the input record to the
 >>> processor node? Is it the input record to the processor?
 >>>
 >>>
 >>> 7.
 >>> Could you please add the packages of the Java classes/interfaces/enums
 >>> you want to add?
 >>>
 >>>
 >>> Best,
 >>> Bruno
 >>>
 >>>
 >>> On 4/9/24 10:17 AM, Bruno Cadonna wrote:
 >>>> Hi Loïc, Damien, and Sébastien,
 >>>>
 >>>> Thanks for the KIP!
 >>>> I find it really great that you contribute back to Kafka Streams
 >>>> concepts you developed for kstreamplify so that everybody can take
 >>>> advantage from your improvements.
 >>>>
 >>>> I have a couple of questions/comments:
 >>>>
 >>>> 1. and 2.
 >>>> I am wondering whether we should expose the processor node ID -- which
 >>>> basically is the processor node name -- in the ProcessingContext
 >>>> interface. I think the processor node ID fits well in the
 >>>> ProcessingContext interface since it already contains application ID
 >> and
 >>>> task ID and it would make the API for the handler cleaner.
 >>>>
 >>>>
 >>>> 3.
 >>>> Could you elaborate -- maybe with an example -- when a record is in a

>>>> state in which it cannot be serialized? This is not completelyclear to

 >>> me.
 >>>>
 >>>>
 >>>> 4.
 >>>> Regarding the metrics, it is not entirely clear to me what the metric

>>>> measures. Is it the number of calls to the process handler or isit the

 >>>> number of calls to process handler that returned FAIL?

>>>> If it is the former, I was also wondering whether it would bebetter to

 >>>> put the task-level metrics to INFO reporting level and remove the
 >>>> thread-level metric, similar to the dropped-records metric. You can
 >>>> always roll-up the metrics to the thread level in your preferred
 >>>> monitoring system. Or do you think we end up with to many metrics?
 >>>>
 >>>>
 >>>> 5.
 >>>> What do you think about naming the handler ProcessingExceptionHandler
 >>>> instead of ProcessExceptionHandler?
 >>>> The DeserializationExceptionHanlder and the ProductionExceptionHandler
 >>>> also use the noun of the action in their name and not the verb.
 >>>>
 >>>>
 >>>> Best,
 >>>> Bruno
 >>>>
 >>>>
 >>>> On 4/8/24 3:48 PM, Sebastien Viale wrote:
 >>>>> Thanks for your review!
 >>>>>
 >>>>> All the points make sense for us!
 >>>>>
 >>>>>
 >>>>>
 >>>>> We updated the KIP for points 1 and 4.
 >>>>>
 >>>>>
 >>>>>
 >>>>> 2/ We followed the DeserializationExceptionHandler interface
 >>>>> signature, it was not on our mind that the record be forwarded with
 >>>>> the ProcessorContext.
 >>>>>
 >>>>> The ProcessingContext is sufficient, we do expect that most people
 >>>>> would need to access the RecordMetadata.
 >>>>>
 >>>>>
 >>>>>
 >>>>> 3/ The use of Record<Object, Object> is required, as the error could
 >>>>> occurred in the middle of a processor where records could be non
 >>>>> serializable objects
 >>>>>
 >>>>> As it is a global error catching, the user may need little
 >>>>> information about the faulty record.
 >>>>>
 >>>>> Assuming that users want to make some specific treatments to the
 >>>>> record, they can add a try / catch block in the topology.
 >>>>>
 >>>>> It is up to users to cast record value and key in the implementation
 >>>>> of the ProcessorExceptionHandler.
 >>>>>
 >>>>>
 >>>>>
 >>>>> Cheers
 >>>>>
 >>>>> Loïc, Damien and Sébastien
 >>>>>
 >>>>> ________________________________

>>>>> De : Sophie Blee-Goldman<sop...@responsive.dev<mailto:sop...@responsive.dev>>

 >>>>> Envoyé : samedi 6 avril 2024 01:08

>>>>> À : dev@kafka.apache.org<mailto:dev@kafka.apache.org><dev@kafka.apache.org<mailto:dev@kafka.apache.org>>

 >>>>> Objet : [EXT] Re: [DISCUSS] KIP-1033: Add Kafka Streams exception
 >>>>> handler for exceptions occuring during processing
 >>>>>
 >>>>> Warning External sender Do not click on any links or open any
 >>>>> attachments unless you trust the sender and know the content is safe.
 >>>>>
 >>>>> Hi Damien,
 >>>>>
 >>>>> First off thanks for the KIP, this is definitely a much needed
 >>>>> feature. On
 >>>>> the
 >>>>> whole it seems pretty straightforward and I am in favor of the
 >> proposal.
 >>>>> Just
 >>>>> a few questions and suggestions here and there:
 >>>>>

>>>>> 1. One of the #handle method's parameters is "ProcessorNodenode", but

 >>>>> ProcessorNode is an internal class (and would expose a lot of
 >> internals

>>>>> that we probably don't want to pass in to an exception handler).Would

 >>> it
 >>>>> be sufficient to just make this a String and pass in the processor
 >> name?
 >>>>>
 >>>>> 2. Another of the parameters in the ProcessorContext. This would
 >> enable
 >>>>> the handler to potentially forward records, which imo should not be
 >> done

>>>>> from the handler since it could only ever call #forward but notdirect

 >>>>> where
 >>>>> the record is actually forwarded to, and could cause confusion if
 >> users
 >>>>> aren't aware that the handler is effectively calling from the context
 >>>>> of the
 >>>>> processor that threw the exception.
 >>>>> 2a. If you don't explicitly want the ability to forward records, I
 >> would
 >>>>> suggest changing the type of this parameter to ProcessingContext,
 >> which
 >>>>> has all the metadata and useful info of the ProcessorContext but
 >> without
 >>>>> the

>>>>> forwarding APIs. This would also lets us sidestep the followingissue:

 >>>>> 2b. If you *do* want the ability to forward records, setting aside
 >>>>> whether
 >>>>> that
 >>>>> in of itself makes sense to do, we would need to pass in either a
 >>> regular

>>>>> ProcessorContext or a FixedKeyProcessorContext, depending on whatkind>>>>> of processor it is. I'm not quite sure how we could design aclean API

 >>>>> here,

>>>>> so I'll hold off until you clarify whether you even wantforwarding or

 >>>>> not.
 >>>>> We would also need to split the input record into a Record vs
 >>>>> FixedKeyRecord
 >>>>>
 >>>>> 3. One notable difference between this handler and the existing ones
 >> you
 >>>>> pointed out, the Deserialization/ProductionExceptionHandler, is that
 >> the

>>>>> records passed in to those are in serialized bytes, whereas therecord

 >>>>> here would be POJOs. You account for this by making the parameter
 >>>>> type a Record<Object, Object>, but I just wonder how users would be
 >>>>> able to read the key/value and figure out what type it should be. For
 >>>>> example, would they need to maintain a map from processor name to
 >>>>> input record types?
 >>>>>
 >>>>> If you could provide an example of this new feature in the KIP, it
 >>>>> would be

>>>>> very helpful in understanding whether we could do something tomake it

 >>>>> easier for users to use, for if it would be fine as-is
 >>>>>
 >>>>> 4. We should include all the relevant info for a new metric, such as
 >> the
 >>>>> metric
 >>>>> group and recording level. You can look at other metrics KIPs like
 >>>>> KIP-444

>>>>> and KIP-613 for an example. I suspect you intend for this to bein the

 >>>>> processor group and at the INFO level?
 >>>>>
 >>>>> Hope that all makes sense! Thanks again for the KIP
 >>>>>
 >>>>> -Sophie
 >>>>>
 >>>>> On Fri, Mar 29, 2024 at 6:16 AM Damien Gasparina <
 >> d.gaspar...@gmail.com<mailto:d.gaspar...@gmail.com>
 >>>>
 >>>>> wrote:
 >>>>>
 >>>>>> Hi everyone,
 >>>>>>
 >>>>>> After writing quite a few Kafka Streams applications, me and my
 >>>>>> colleagues
 >>>>>> just created KIP-1033 to introduce a new Exception Handler in Kafka
 >>>>>> Streams
 >>>>>> to simplify error handling.
 >>>>>> This feature would allow defining an exception handler to
 >> automatically
 >>>>>> catch exceptions occurring during the processing of a message.
 >>>>>>
 >>>>>> KIP link:
 >>>>>>
 >>>>>>
 >>>

>>https://cwiki.apache.org/confluence/display/KAFKA/KIP-1033%3A+Add+Kafka+Streams+exception+handler+for+exceptions+occuring+during+processing <https://cwiki.apache.org/confluence/display/KAFKA/KIP-1033%3A+Add+Kafka+Streams+exception+handler+for+exceptions+occuring+during+processing><https://cwiki.apache.org/confluence/display/KAFKA/KIP-1033%3A+Add+Kafka+Streams+exception+handler+for+exceptions+occuring+during+processing <https://cwiki.apache.org/confluence/display/KAFKA/KIP-1033%3A+Add+Kafka+Streams+exception+handler+for+exceptions+occuring+during+processing>>

 >>> <
 >>>

 >>>>
 >>>>>>
 >>>>>> Feedbacks and suggestions are welcome,
 >>>>>>
 >>>>>> Cheers,
 >>>>>> Damien, Sebastien and Loic
 >>>>>>
 >>>>>
 >>>>> This email was screened for spam and malicious content but exercise
 >>>>> caution anyway.
 >>>>>
 >>>>>
 >>>
 >>

Re: [DISCUSS] KIP-1033: Add Kafka Streams exception handler for exceptions occuring during processing

Reply via email to