[ https://issues.apache.org/jira/browse/METRON-1968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Ryan Merriman updated METRON-1968: ---------------------------------- Fix Version/s: Next + 1 > Messages are lost when a parser produces multiple messages and batch size is > greater than 1 > ------------------------------------------------------------------------------------------- > > Key: METRON-1968 > URL: https://issues.apache.org/jira/browse/METRON-1968 > Project: Metron > Issue Type: Bug > Reporter: Ryan Merriman > Assignee: Ryan Merriman > Priority: Major > Fix For: Next + 1 > > Time Spent: 15h 20m > Remaining Estimate: 0h > > A bug was discovered where messages are lost when a parser produces multiple > messages. This happens anytime the batch size for that sensor is set to > greater than 1. For example, consider a parser that produces 30 messages > from a single input message. Assume the batch size for this sensor/parser is > set to 10. The batch is currently flushed only after 10 tuples are received > and only 10 messages are written out. I think the correct behavior would be > for 3 batches of 10 messages to be flushed for every tuple and a total of 300 > messages written for every 10 tuples. > This is happening because the various writer classes/interfaces > (BulkWriterComponent, BulkMessageWriter, KafkaWriter, etc) assume a 1 to 1 > relationship between messages and tuples. The root cause of this specific > issue is > [here|https://github.com/apache/metron/blob/master/metron-platform/metron-writer/src/main/java/org/apache/metron/writer/kafka/KafkaWriter.java#L236]. -- This message was sent by Atlassian JIRA (v7.6.3#76005)