Do you want to calculate it and share it once with all other executors? Then a 
broadcast variable maybe interesting for you,

> On 22. Sep 2018, at 16:33, Soheil Pourbafrani <soheil.i...@gmail.com> wrote:
> 
> Hi, I want to do some processing with PySpark and save the results in a 
> variable of type tuple that should be shared among the executors for further 
> processing.
> Actually, it's a Text Mining Processing and I want to use the Vector Space 
> Model. So I want to calculate the Vector of all Words (that should be 
> reachable for all executors) and save it in a tuple. Is it possible in Spark 
> or I should use external storage like database or files?
> 
> 

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to