Re: (de)serialize DStream

2015-07-08 Thread Shixiong Zhu
DStream must be Serializable, it's metadata checkpointing. But you can use
KryoSerializer for data checkpointing. The data checkpointing uses
RDD.checkpoint which can be set by spark.serializer.

Best Regards,
Shixiong Zhu

2015-07-08 3:43 GMT+08:00 Chen Song chen.song...@gmail.com:

 In Spark Streaming, when using updateStateByKey, it requires the generated
 DStream to be checkpointed.

 It seems that it always use JavaSerializer, no matter what I set for
 spark.serializer. Can I use KryoSerializer for checkpointing? If not, I
 assume the key and value types have to be Serializable?

 Chen



(de)serialize DStream

2015-07-07 Thread Chen Song
In Spark Streaming, when using updateStateByKey, it requires the generated
DStream to be checkpointed.

It seems that it always use JavaSerializer, no matter what I set for
spark.serializer. Can I use KryoSerializer for checkpointing? If not, I
assume the key and value types have to be Serializable?

Chen