[ 
https://issues.apache.org/jira/browse/KAFKA-3534?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Eno Thereska updated KAFKA-3534:
--------------------------------
    Assignee:     (was: Guozhang Wang)

> Deserialize on demand when default time extractor used
> ------------------------------------------------------
>
>                 Key: KAFKA-3534
>                 URL: https://issues.apache.org/jira/browse/KAFKA-3534
>             Project: Kafka
>          Issue Type: Improvement
>          Components: streams
>    Affects Versions: 0.9.0.1
>            Reporter: Michael Coon
>            Priority: Minor
>              Labels: performance
>
> When records are added to the RecordQueue, they are deserialized at that time 
> in order to extract the timestamp. But for some data flows where large 
> messages are consumed (particularly compressed messages), this can result in 
> large spikes in memory as all messages must be deserialized prior to 
> processing (and getting out of memory). An optimization might be to only 
> require deserialization at this stage if a non-default timestamp extractor is 
> being used.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to