So I have a large data structure that I want to broadcast to my executors. It is so large that it makes sense to share access to the object between multiple tasks, so I create my executors with multiple cores. Unfortunately, it looks like the object is not shared between threads, but is copied once for each thread.
Is my assumption right (1 copy per thread) and is there any way I can force a broadcast to be shared between threads? Using 1.4.1 Thanks! --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org