Hi, I am trying to build real time anomaly detection system using Spark, kafka, Cassandra and Akka. I have network intrusion dataset (KDD 1999 cup). how can i build the system using this ? I understood that certain part of the data, I am considering as historical data for my model training and other data, I would simulate as stream of data coming through kafka.
Should I use spark streaming for re-training the model on incoming stream ? How can I use Akka in this for alerting purpose ?