So I have a large data structure that I want to broadcast to my executors. It 
is so large that it makes sense to share access to the object between multiple 
tasks, so I create my executors with multiple cores. Unfortunately, it looks 
like the object is not shared between threads, but is copied once for each 
thread.

Is my assumption right (1 copy per thread) and is there any way I can force a 
broadcast to be shared between threads?

Using 1.4.1

Thanks!


---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to