Hi,
My topology is like the following ->
*Spout -> Bolt A -> Bolt B -> Bolt C -> Bolt D*

the groupings between* Bolt C -> Bolt D* is a field grouping as Bolt D is
doing an aggregation while everything else is a shuffleGrouping.

Use Case:
If the spout emits 100K tuples such that the emits all are grouped on the
same field, then Bolt D will need to take all the load and hence becomes
very slow. IN this case, increasing the no. of instances of Bolt D will not
help as the grouping is for the instance of Bolt D.

Question: How can this be optimized?

Did anyone face such a use case? Please recommend.

Thanks
Kashyap

Reply via email to