[jira] [Commented] (KAFKA-9450) Decouple inner state flushing from committing with EOS

Matthias J. Sax (Jira) Mon, 27 Jan 2020 15:21:34 -0800


    [ 
https://issues.apache.org/jira/browse/KAFKA-9450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17024742#comment-17024742
 ]


Matthias J. Sax commented on KAFKA-9450:
----------------------------------------

Yes, the ticket was created because the default EOS commit interval of 100ms 
make the situation much worse. But I agree, that fixing this can also benefit 
the non-EOS case. Hence, I don't think we need a separate ticket.

However, for non-EOS, increasing the commit interval to work against this issue 
is actually more viable than for the EOS case (as for the EOS case, it would 
increase the end-to-end processing latency significantly; what it the reason 
for the 100ms default commit interval to begin with).

> Decouple inner state flushing from committing with EOS
> ------------------------------------------------------
>
>                 Key: KAFKA-9450
>                 URL: https://issues.apache.org/jira/browse/KAFKA-9450
>             Project: Kafka
>          Issue Type: Improvement
>          Components: streams
>            Reporter: Sophie Blee-Goldman
>            Priority: Major
>
> When EOS is turned on, the commit interval is set quite low (100ms) and all 
> the store layers are flushed during a commit. This is necessary for 
> forwarding records in the cache to the changelog, but unfortunately also 
> forces rocksdb to flush the current memtable before it's full. The result is 
> a large number of small writes to disk, losing the benefits of batching, and 
> a large number of very small L0 files that are likely to slow compaction.
> Since we have to delete the stores to recreate from scratch anyways during an 
> unclean shutdown with EOS, we may as well skip flushing the innermost 
> StateStore during a commit and only do so during a graceful shutdown, before 
> a rebalance, etc. This is currently blocked on a refactoring of the state 
> store layers to allow decoupling the flush of the caching layer from the 
> actual state store.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

[jira] [Commented] (KAFKA-9450) Decouple inner state flushing from committing with EOS

Reply via email to