Hi Saikat, Have you considered this approach,
http://spark.apache.org/docs/latest/streaming-flume-integration.html <http://spark.apache.org/docs/latest/streaming-flume-integration.htmlhttp://stdatalabs.blogspot.in/2016/09/spark-streaming-part-2-real-time_10.html> Thanks, Johny On Thu, Feb 2, 2017 at 5:27 PM, Saikat Kanjilal <sxk1...@hotmail.com> wrote: > Hi Flume community, > Would love to have inputs on this topic as this is a pertinent usecase > that I'm exploring at work. > Regards > > Sent from my iPhone > > On Feb 1, 2017, at 4:36 PM, Saikat Kanjilal <sxk1...@hotmail.com<mailto:sx > k1...@hotmail.com>> wrote: > > > Bump [😊] > > > ________________________________ > From: Saikat Kanjilal <sxk1...@hotmail.com<mailto:sxk1...@hotmail.com>> > Sent: Wednesday, February 1, 2017 8:46 AM > To: dev@flume.apache.org<mailto:dev@flume.apache.org> > Subject: Flume+ML [Discussion] > > Hi Folks, > > I've been a bit delayed working on the graph sink for flume ( > https://github.com/skanjila/flume-ng-graphstore-sink) , in the meantime I > was wondering if there's been any thought or interest in connecting flume > to spark, I have a potential use case where we need to extract data out of > multiple data sources, do a set of transformations on this data and then > dump this data to a columnar store for downstream processing through a > Revoscale R cluster which uses spark underneath. I'd be interested in > leading this effort if there's enough interest in the community around use > cases for this. > [https://avatars0.githubusercontent.com/u/674374?v=3&s=400]<https:// > github.com/skanjila/flume-ng-graphstore-sink> > > skanjila/flume-ng-graphstore-sink<https://github.com/ > skanjila/flume-ng-graphstore-sink> > github.com<http://github.com> > flume-ng-graphstore-sink - A flume sink that writes to a set of graph > databases > > > > > [https://avatars0.githubusercontent.com/u/674374?v=3&s=400]<https:// > github.com/skanjila/flume-ng-graphstore-sink> > > skanjila/flume-ng-graphstore-sink<https://github.com/ > skanjila/flume-ng-graphstore-sink> > github.com<http://github.com> > flume-ng-graphstore-sink - A flume sink that writes to a set of graph > databases > > Look forward to hearing from folks. >