Re: Akka Stream as the source for Spark Streaming. Please advice...

2016-11-12 Thread Sean McKibben
I think one of the advantages of using akka-streams within Spark is the fact that it is a general purpose stream processing toolset with backpressure, not necessarily specific to kafka. If things work out with the approach, Spark could be a great benefit to use as a coordination framework for

Re: Instability issues with Spark 2.0.1 and Kafka 0.10

2016-11-12 Thread Sean McKibben
poll.ms/> to 30 seconds and > > still get the issue. > > Something is not right here and just not seem right. As I mentioned with the > > streaming application, with Spark 1.6 and Kafka 0.8.x we never saw this > > issue. We have been running the same basic logic for over

Re: Instability issues with Spark 2.0.1 and Kafka 0.10

2016-11-07 Thread Sean McKibben
I've been encountering the same kinds of timeout issues as Ivan, using the "Kafka Stream" approach that he is using, except I'm storing my offsets manually from the driver to Zookeeper in the Kafka 8 format. I haven't yet implemented the KafkaRDD approach, and therefore don't have the