Sachin Goyal created STORM-1686:
-----------------------------------

             Summary: Make local-grouping an independent option rather than 
just LOCAL_OR_SHUFFLE
                 Key: STORM-1686
                 URL: https://issues.apache.org/jira/browse/STORM-1686
             Project: Apache Storm
          Issue Type: Improvement
            Reporter: Sachin Goyal


Currently, Storm distributes the tuples to bolts irrespective of their locality.
So if 1 spout and 5 bolts are running in the same process, storm does not give 
any preference to the local 5 bolts (except for LOCAL_OR_SHUFFLE grouping).

Due to this, there is a lot of inter-machine/inter-process communication for 
sending out the tuples.

It would be really good if local-mode can be made as an independent option 
(*preferLocalBolts*) rather than having just a single mode LOCAL_OR_SHUFFLE 
that can distribute locally.

At least for fields-grouping, the local-mode would make a lot of sense.

Currently, the alternatives to the local-mode-with-fields-grouping is:
# Run several topologies, each with 1 worker so as to limit the spouts/bolts to 
one machine only.
# Have another bolt between a spout and bolt that can be used for directing the 
tuple traffic by directed streams.

But both of these seem to be a lot of work for something that should be made an 
independent option - *preferLocalBolts*.

Also see the following SO content:
# 
[way-to-apply-multiple-groupings-in-storm|http://stackoverflow.com/questions/36368224/is-there-a-way-to-apply-multiple-groupings-in-storm/36374837#36374837]
# 
[fields-grouping-for-same-machine|http://stackoverflow.com/questions/35132136/storm-fields-grouping-for-same-machine?lq=1]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to