Spark Streaming Cluster queries

puneetloya Sat, 27 Jan 2018 09:07:44 -0800

Hi All,

A cluster of one spark driver and multiple executors(5) is setup with redis
for spark processed data storage and s3 is used for checkpointing. I have a
couple of queries about this setup.


1) How to analyze what part of code executes on Spark Driver and what part
of code executes on the executors?
2) As Spark driver gets results from spark executors, should Spark executors
have any access to redis storage? ( My guess is yes, because executors may
need data from redis for further calculations)
3) Should the spark executors have access to the checkpoint storage s3?
4) Can anyone share their checkpoint recovery strategy for s3?

Thanks,
Puneet



--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Spark Streaming Cluster queries

Reply via email to