increasing throughput when capacity <1

Jens-U. Mozdzen Fri, 19 Dec 2014 09:20:07 -0800

Hi,

we have a topology with 34 bolts and a single spout, running 101 tasksin 101 executors.

When running the topology in a single worker, we reach a certainthroughput "x".

Since some of the bolts will be rather memory-intensive, we decided tosplit the topology across 4 workers, on four physically separatedserver machines.

When running that 4-worker setup, we see only a slight increase ofthroughput. None of the servers is under heavy CPU load and all boltshave a capacity below .5, according to the web ui.

The spout implementation is able to easily serve 100 times moretuples, so it's not a problem of "starvation" at that level.

Do I see it right that the bottleneck is not within the topologyimplementation, but in the inter-worker communications setup? Else I'dhave expected to see some bolt to run at 1.0 or higher capacity.

(We had run the same test on older hardware, where the topology wasable to consume all available CPU - there, spreading across morehardware lead to a significant throughput increase, and "capacity" wasreported as >1 for some bolts.)


How would I proceed to identify the actual bottleneck(s)?

Regards,
Jens

increasing throughput when capacity <1

Reply via email to