Hi All,

Here is a problem and I am wondering if Spark Streaming is the right tool
for this ?

I have stream of messages m1, m2, m3....and each of those messages can be
in state s1, s2, s3,....sn (you can imagine the number of states are about
100) and I want to compute some metrics that visit all the states from s1
to sn but these state transitions can happen at indefinite amount of
time. A simple example of that would be count all messages that visited
state s1, s2, s3. Other words, the transition function should know that say
message m1 had visited state s1 and s2 but not s3 yet and once the message
m1 visits s3 increment the counter +=1 .

If it makes anything easier I can say a message has to visit s1 before
visiting s2 and s2 before visiting s3 and so on but would like to know both
with and without order.

Thanks!

Reply via email to