Reposting the reply that was posted to dev mailing list :-

For storm core, windowed bolts would give you the tuples in the last minute but 
you would have to do the grouping yourself. You could of-course use a fields 
grouping to split the load across the windowed bolts. For trident you might 
want to take a look at the windowing apis that were added recently and see if 
it fits your need. You have to choose between trident and core based on your 
use cases, the guarantee you need and if you need batching vs per tuple 
processing etc.

- Arun

From: Maria Musterfrau
Reply-To: "user@storm.apache.org<mailto:user@storm.apache.org>"
Date: Wednesday, March 30, 2016 at 10:56 PM
To: "user@storm.apache.org<mailto:user@storm.apache.org>"
Subject: Fw: Combining group by and time window

Does anyone have an idea?

Thank you in advance.

Regards,
Daniela

Gesendet: Montag, 28. März 2016 um 21:06 Uhr
Von: "Maria Musterfrau" <daniela_4...@gmx.at<mailto:daniela_4...@gmx.at>>
An: user@storm.apache.org<mailto:user@storm.apache.org>
Betreff: Combining group by and time window
Hi,

I have a stream with time series data from different regions. I would like to 
group the stream by the different regions and to add up the values of the last 
minute (time window) per region. The sums should be persisted to Redis or 
something like this.

I already found out that Storm Trident provides a group by function to split 
the stream. I think this could be useful.
Storm core provides time windows, so I could use it for the aggregation.

But how can I combine these two components? Or is this not possible?

Would it be useful to do the grouping already in Kafka (with different topics) 
or is it better to do it in Storm

Thank you in advance.

Regards,
Daniela

Reply via email to