[
https://issues.apache.org/jira/browse/KAFKA-1952?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14322246#comment-14322246
]
Jun Rao commented on KAFKA-1952:
--------------------------------
Uploaded a new patch that addresses the review comments.
Some test results.
Create a 2 node cluster locally. Create a topic X with 3000 partitions and a
replication factor of 2 (wait until all leaders are elected; this can take 5
minutes). Without the patch, when there is no produce load on the cluster, each
broker uses 100% CPU and the localTime for each replica fetch request is about
1 sec. With the patch, each broker uses about 10% CPU and the localTime for
each replica fetch request is about 9ms.
Also tested the end to end latency with the patch. On the above 2 node cluster,
create a topic test with 1 partition and a replication factor of 2.
bin/kafka-run-class.sh kafka.tools.TestEndToEndLatency localhost:9092
localhost:2181 test 5000 500 -1
0 61.206
1000 26.766
2000 38.58
3000 25.647
4000 26.991
Avg latency: 28.5952 ms
Percentiles: 50th = 27, 99th = 41, 99.9th = 52
The latency is bounded by the message commit time. With 3000 partitions, each
replica fetch request take 9ms in localTime and another 9ms in sendTime. So the
time to complete a produce request (with ack=-1) is about 26ms on average. In
any case, the end to end latency is way less than the maxWaitTime in the
regular consumer and replica fetcher (both at 500ms), which is expected.
Also measured the localTime and sendTime for the same replica fetch requests on
0.8.1.1. The numbers are comparable with 0.8.2.0 with the patch.
> High CPU Usage in 0.8.2 release
> -------------------------------
>
> Key: KAFKA-1952
> URL: https://issues.apache.org/jira/browse/KAFKA-1952
> Project: Kafka
> Issue Type: Bug
> Affects Versions: 0.8.2.0
> Reporter: Jay Kreps
> Assignee: Jun Rao
> Priority: Critical
> Fix For: 0.8.2.0
>
> Attachments: kafka-1952.patch, kafka-1952_2015-02-15_15:26:33.patch
>
>
> Brokers with high partition count see increased CPU usage when migrating from
> 0.8.1.1 to 0.8.2.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)