Re: Execute function once on each node

2016-07-19 Thread Josh Asplund
tresata.com> wrote: > >> If you run hdfs on those ssds (with low replication factor) wouldn't it >> also effectively write to local disk with low latency? >> >> On Jul 18, 2016 21:54, "Josh Asplund" <joshaspl...@gmail.com> wrote: >> >> The s

Re: Execute function once on each node

2016-07-19 Thread Josh Asplund
are in right ip/hostname (or fail) and read the content of > the file. > > Not a 100% sure it will work though. > > On Tue, Jul 19, 2016, 2:54 AM Josh Asplund <joshaspl...@gmail.com> wrote: > >> The spark workers are running side-by-side with scientific simulation >> co

Re: Execute function once on each node

2016-07-18 Thread Josh Asplund
The spark workers are running side-by-side with scientific simulation code. The code writes output to local SSDs to keep latency low. Due to the volume of data being moved (10's of terabytes +), it isn't really feasible to copy the data to a global filesystem. Executing a function on each node