Hi Folks, I've been a bit delayed working on the graph sink for flume (https://github.com/skanjila/flume-ng-graphstore-sink) , in the meantime I was wondering if there's been any thought or interest in connecting flume to spark, I have a potential use case where we need to extract data out of multiple data sources, do a set of transformations on this data and then dump this data to a columnar store for downstream processing through a Revoscale R cluster which uses spark underneath. I'd be interested in leading this effort if there's enough interest in the community around use cases for this.
[https://avatars0.githubusercontent.com/u/674374?v=3&s=400]<https://github.com/skanjila/flume-ng-graphstore-sink> skanjila/flume-ng-graphstore-sink<https://github.com/skanjila/flume-ng-graphstore-sink> github.com flume-ng-graphstore-sink - A flume sink that writes to a set of graph databases Look forward to hearing from folks.
