New to apache storm

2015-04-21 Thread Hasan Riaz
Hello to all, I am new to apache storm and have been working with it for the last month or so. We are trying to design a topology wherein: - A json message is broken up into multiple parts - each of these parts is processed in a parallel manner - the results are aggregated via a Grouping Bolt This

Storm Multilang -- Which queue am I blowing up?

2015-04-21 Thread Srikanth
Hello, I'm writing a storm topology with a multilang bolt and see that worker is crashing consistently. My data flow is like this, KafkaSpout -> KafkaDecoderBolt -> > RankingBolt(Multilang) -> ... Problem seems to be with the KafkaDecoderBolt -> RankingBolt pipeline. On

Fwd: Storm Trident Topology -- ParallelismHint

2015-04-21 Thread nitin sharma
Hi Team, I am trying to understand ParallelismHint in Trident Topology but somehow not getting anywhere close.. It will be great if someone can help me Also, kindly explain the difference between MasterCorrdinator and SpoutCoordinator... Things that i have tried so far: a. I created a simple

How to check the time a tuple spends waiting in an incoming Queue until it gets processed

2015-04-21 Thread Nick R. Katsipoulakis
Hello all, I have been trying to calculate, as precisely as possible, how much time a tuple spends waiting in the input queue of a Bolt until it is provided in the BaseRichBolt.execute() function. The way I try to get this wait time (latency) is by adding a timestamp attribute in the tuple, the t

Re: Delay at Trident topology startup.

2015-04-21 Thread Vladimir Protsenko
Here are additional logs. System is loosing messages about expected tuples count (tracked.expectedTupleCount) and replays those that are lost only after 30 seconds (on 35 _system tick). Max spout pending parameter has vicarious influence on this delay. If I make it too small, batch would be replaye