Re: [Spark Streaming] is spark.streaming.concurrentJobs a per node or a cluster global value ?

thomas lavocat Tue, 05 Jun 2018 04:18:01 -0700

Hello,

Thank's for your answer.



On 05/06/2018 11:24, Saisai Shao wrote:

spark.streaming.concurrentJobs is a driver side internalconfiguration, this means that how many streaming jobs can besubmitted concurrently in one batch. Usually this should not beconfigured by user, unless you're familiar with Spark Streaminginternals, and know the implication of this configuration.


How can I find some documentation about those implications ?

I've experimented some configuration of this parameters and found outthat my overall throughput is increased in correlation with this property.But I'm experiencing scalability issues. With more than 16 receiversspread over 8 executors, my executors no longer receive work from thedriver and fall idle.

Is there an explanation ?

Thanks,
Thomas

Re: [Spark Streaming] is spark.streaming.concurrentJobs a per node or a cluster global value ?

Reply via email to