Offset checkpoints (partition, offset) when using kafka direct streaming approach
On Friday, October 2, 2015, Tathagata Das <t...@databricks.com> wrote: > Which checkpointing are you talking about? DStream checkpoints (which > saves the DAG of DStreams, that is, only metadata), or RDD checkpointing > (which saves the actual intermediate RDD data) > > TD > > On Fri, Oct 2, 2015 at 2:56 PM, Sourabh Chandak <sourabh3...@gmail.com > <javascript:_e(%7B%7D,'cvml','sourabh3...@gmail.com');>> wrote: > >> Tried using local checkpointing as well, and even that becomes slow after >> sometime. Any idea what can be wrong? >> >> Thanks, >> Sourabh >> >> On Fri, Oct 2, 2015 at 9:35 AM, Sourabh Chandak <sourabh3...@gmail.com >> <javascript:_e(%7B%7D,'cvml','sourabh3...@gmail.com');>> wrote: >> >>> I can see the entries processed in the table very fast but after that it >>> takes a long time for the checkpoint update. >>> >>> Haven't tried other methods of checkpointing yet, we are using DSE on >>> Azure. >>> >>> Thanks, >>> Sourabh >>> >>> On Fri, Oct 2, 2015 at 6:52 AM, Cody Koeninger <c...@koeninger.org >>> <javascript:_e(%7B%7D,'cvml','c...@koeninger.org');>> wrote: >>> >>>> Why are you sure it's checkpointing speed? >>>> >>>> Have you compared it against checkpointing to hdfs, s3, or local disk? >>>> >>>> On Fri, Oct 2, 2015 at 1:17 AM, Sourabh Chandak <sourabh3...@gmail.com >>>> <javascript:_e(%7B%7D,'cvml','sourabh3...@gmail.com');>> wrote: >>>> >>>>> Hi, >>>>> >>>>> I have a receiverless kafka streaming job which was started yesterday >>>>> evening and was running fine till 4 PM today. Suddenly post that writing >>>>> of >>>>> checkpoint has slowed down and it is now not able to catch up with the >>>>> incoming data. We are using the DSE stack with Spark 1.2 and Cassandra for >>>>> checkpointing. Spark streaming is done using a backported code. >>>>> >>>>> Running nodetool shows that the Read latency of the cfs keyspace is >>>>> ~8.5 ms. >>>>> >>>>> Can someone please help me resolve this? >>>>> >>>>> Thanks, >>>>> Sourabh >>>>> >>>>> >>>> >>> >> >