Hi Folks,

I've been a bit delayed working on the graph sink for flume 
(https://github.com/skanjila/flume-ng-graphstore-sink) , in the meantime I was 
wondering if there's been any thought or interest in connecting flume to spark, 
I have a potential use case where we need to extract data out of multiple data 
sources, do a set of transformations on this data and then dump this data to a 
columnar store for downstream processing through a Revoscale R cluster which 
uses spark underneath.  I'd be interested in leading this effort if there's 
enough interest in the community around use cases for this.

[https://avatars0.githubusercontent.com/u/674374?v=3&s=400]<https://github.com/skanjila/flume-ng-graphstore-sink>

skanjila/flume-ng-graphstore-sink<https://github.com/skanjila/flume-ng-graphstore-sink>
github.com
flume-ng-graphstore-sink - A flume sink that writes to a set of graph databases

Look forward to hearing from folks.

Reply via email to