Github user roshannaik commented on the issue:
https://github.com/apache/storm/pull/2502
@HeartSaVioR
I am not seeing the reported excessive CPU usage behavior for 4 workers.
First I tried taking some runs on my mid 2015 macbook pro (single cpu socket).
Since no issue was seen, i reran the same on a linux server (2 sockets, 6
physical cores per socket).
The below cmd was run.
{code}
bin/storm jar storm-loadgen-*.jar
org.apache.storm.loadgen.ThroughputVsLatency --rate 85000 --spouts 4
--splitters 4 --counters 4 -c topology.max.spout.pending=5000 -c
topology.workers=4
{code}
All runs were similar so just posting results from one run for each.
## Mac Book Runs:
**Master without 2306** (#ab7b4ca) [**Latency from UI =240ms**]
start(s) | end(s) | rate(tuple/s) | mean(ms) | 99%ile(ms) | 99.9%ile(ms) |
cores | mem(MB)
-- | -- | -- | -- | -- | -- | -- | --
0 | 30 | 19,319.467 | 8,411.912 | 13,153.337 | 13,254.001 | 5.275 | 340.719
30 | 60 | 72,613.500 | 16,113.830 | 17,750.295 | 17,884.512 | 6.849 |
364.807
60 | 90 | 75,519.067 | 19,671.741 | 21,474.836 | 21,558.723 | 6.755 |
344.906
90 | 120 | 79,225.167 | 22,121.255 | 23,823.647 | 23,941.087 | 6.878 |
222.115
120 | 150 | 80,967.567 | 23,784.484 | 25,585.254 | 25,652.363 | 6.889 |
365.383
150 | 180 | 80,062.067 | 25,269.534 | 27,665.629 | 27,866.956 | 6.895 |
343.674
180 | 210 | 78,306.300 | 27,560.823 | 30,467.424 | 30,551.310 | 7.025 |
436.814
210 | 240 | 78,586.667 | 29,673.204 | 32,883.343 | 33,000.784 | 6.945 |
293.014
240 | 270 | 80,500.667 | 31,513.715 | 34,795.946 | 34,829.500 | 6.818 |
416.121
270 | 300 | 82,625.667 | 32,883.453 | 35,903.242 | 35,970.351 | 6.899 |
364.058
**Master with 2306** (#09e0123) [**Latency from UI =89ms**]
start(s) | end(s) | rate(tuple/s) | mean(ms) | 99%ile(ms) | 99.9%ile(ms) |
cores | mem(MB)
-- | -- | -- | -- | -- | -- | -- | --
0 | 30 | 19,607.100 | 8,127.475 | 14,277.411 | 14,772.339 | 4.861 | 330.486
30 | 60 | 85,946.533 | 12,737.708 | 18,471.715 | 18,589.155 | 6.587 |
418.567
60 | 90 | 91,256.133 | 12,276.112 | 17,901.289 | 18,001.953 | 6.527 |
229.531
90 | 120 | 95,317.967 | 9,204.098 | 14,529.069 | 14,612.955 | 6.552 |
432.803
120 | 150 | 97,220.233 | 5,221.476 | 9,865.003 | 10,125.050 | 6.551 |
169.757
150 | 180 | 92,499.200 | 984.904 | 4,213.178 | 4,546.626 | 6.746 | 280.883
180 | 210 | 79,557.700 | 1,059.939 | 2,619.343 | 2,766.143 | 6.155 | 430.853
210 | 240 | 79,766.467 | 2,336.027 | 5,158.994 | 5,347.738 | 6.238 | 288.708
240 | 270 | 81,595.800 | 4,284.723 | 7,377.781 | 7,528.776 | 6.258 | 315.524
270 | 300 | 88,294.067 | 5,024.767 | 8,422.162 | 8,493.466 | 6.412 | 263.682
## Linux Server Runs:
**Master without 2306** (#ab7b4ca) [**Latency from UI =15ms**]
start_time(s) | end_time(s) | rate(tuple/s) | mean(ms) | 99%ile(ms) |
99.9%ile(ms) | cores | mem(MB)
-- | -- | -- | -- | -- | -- | -- | --
0 | 30 | 56,704.80 | 1,194.20 | 3,867.15 | 3,978.30 | 11.294 | 978.701
30 | 60 | 85,001.70 | 12.018 | 21.479 | 26.837 | 11.931 | 897.105
60 | 90 | 85,007.17 | 11.84 | 21.152 | 26.345 | 11.575 | 950.129
90 | 120 | 85,008.13 | 11.781 | 20.447 | 24.986 | 11.688 | 882.768
120 | 150 | 85,009.57 | 11.841 | 21.021 | 25.592 | 11.735 | 921.339
150 | 180 | 84,997.60 | 11.763 | 20.791 | 25.199 | 11.476 | 961.395
180 | 210 | 85,005.57 | 11.824 | 20.66 | 25.281 | 11.798 | 1,000.03
210 | 240 | 85,008.00 | 11.75 | 20.611 | 25.706 | 11.354 | 1,120.33
240 | 271 | 82,259.16 | 11.916 | 20.955 | 25.117 | 11.517 | 939.918
271 | 301 | 85,003.57 | 11.719 | 20.398 | 24.822 | 11.322 | 980.654
**Master with 2306** (#09e0123) [**Latency from UI =7.8ms**]
start_time(s) | end_time(s) | rate(tuple/s) | mean(ms) | 99%ile(ms) |
99.9%ile(ms) | cores | mem(MB)
-- | -- | -- | -- | -- | -- | -- | --
0 | 30 | 56,701.03 | 426.534 | 2,673.87 | 2,793.41 | 7.693 | 290.006
30 | 60 | 85,004.40 | 3.387 | 8.294 | 14.655 | 5.89 | 231.328
60 | 90 | 85,002.20 | 3.332 | 7.66 | 10.945 | 5.791 | 247.683
90 | 121 | 82,259.87 | 3.33 | 7.516 | 9.839 | 5.591 | 264.324
121 | 151 | 85,004.37 | 3.349 | 7.737 | 10.969 | 5.819 | 272.916
151 | 181 | 85,001.07 | 3.323 | 7.434 | 9.961 | 5.747 | 203.54
181 | 211 | 85,002.57 | 3.335 | 7.586 | 10.281 | 5.794 | 304.567
211 | 241 | 85,005.03 | 3.326 | 7.475 | 9.921 | 5.836 | 317.026
241 | 271 | 85,003.27 | 3.339 | 7.565 | 10.527 | 5.723 | 251.676
271 | 301 | 85,002.00 | 3.351 | 7.799 | 12.394 | 5.716 | 377.182
## Summary:
**On Linux** Both CPU and mem usage was significantly better for 2306.
Actual latency (taken from UI) was also much better for 2306.
**On Macbook:** Both CPU & mem usage were relatively close but slightly
favoring 2306. Again, the actual latency was much better for 2306.
---