[jira] [Commented] (FLUME-3044) KafkaSink should avoid to call method without timeout param

2017-02-03 Thread Jeff Holoman (JIRA)
[ https://issues.apache.org/jira/browse/FLUME-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15852565#comment-15852565 ] Jeff Holoman commented on FLUME-3044: - Thanks. I will take a look this weekend. Jeff

[jira] [Commented] (FLUME-3044) KafkaSink should avoid to call method without timeout param

2017-02-03 Thread dengkai (JIRA)
[ https://issues.apache.org/jira/browse/FLUME-3044?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15852544#comment-15852544 ] dengkai commented on FLUME-3044: [~jholoman] New review board request created, then who ca

Re: Flume+ML [Discussion]

2017-02-03 Thread Saikat Kanjilal
+1, my only additions would to expand this to make this work with spark sql and provide spark compute context (https://docs.microsoft.com/en-us/azure/hdinsight/hdinsight-hadoop-r-server-compute-contexts) accessibility to the data from the sink, I'd love to take these other bits on if there's en

Re: Flume+ML [Discussion]

2017-02-03 Thread Tristan Stevens
Johny, This is definitely the right way to do this. There’s a Sink available already (from the docs that you provided) at https://github.com/apache/spark/blob/master/external/flume-sink/src/main/scala/org/apache/spark/streaming/flume/sink/SparkSink.scala There’s no reason that we couldn’t distrib