Starting from the checkpoint using getOrCreate should be sufficient if all
you need is at-least-once semantics

http://spark.apache.org/docs/latest/streaming-programming-guide.html#checkpointing

On Mon, Nov 30, 2015 at 9:38 AM, Guillermo Ortiz <konstt2...@gmail.com>
wrote:

> Hello,
>
> I have Spark and Kafka with directStream. I'm trying that if Spark dies it
> could process all those messages when it starts.  The offsets are stored in
> chekpoints but I don't know how I could say to Spark to start in that point.
> I saw that there's another createDirectStream method with a fromOffsets
> parameter but, how could I access to the offsets?
>
> val ssc = new StreamingContext(sparkConf, Seconds(5))
> ssc.checkpoint(checkpoint)
> val directKafkaStream = KafkaUtils.createDirectStream[String, String, 
> StringDecoder, StringDecoder](ssc, kafkaBrokers, topic)
>
>

Reply via email to