Hello to all,
I am new to apache storm and have been working with it for the last month
or so. We are trying to design a topology wherein:
- A json message is broken up into multiple parts
- each of these parts is processed in a parallel manner
- the results are aggregated via a Grouping Bolt
This
Hello,
I'm writing a storm topology with a multilang bolt and see that worker is
crashing consistently. My data flow is like this,
KafkaSpout -> KafkaDecoderBolt ->
> RankingBolt(Multilang) -> ...
Problem seems to be with the KafkaDecoderBolt -> RankingBolt pipeline.
On
Hi Team,
I am trying to understand ParallelismHint in Trident Topology but somehow
not getting anywhere close.. It will be great if someone can help me
Also, kindly explain the difference between MasterCorrdinator and
SpoutCoordinator...
Things that i have tried so far:
a. I created a simple
Hello all,
I have been trying to calculate, as precisely as possible, how much time a
tuple spends waiting in the input queue of a Bolt until it is provided in
the BaseRichBolt.execute() function.
The way I try to get this wait time (latency) is by adding a timestamp
attribute in the tuple, the t
Here are additional logs. System is loosing messages about expected tuples
count (tracked.expectedTupleCount) and replays those that are lost only
after 30 seconds (on 35 _system tick). Max spout pending parameter has
vicarious influence on this delay. If I make it too small, batch would be
replaye