I am launching a rather large cluster on ec2. It seems like the launch is taking forever on .... Setting up spark RSYNC'ing /root/spark to slaves... ...
It seems that bittorrent might be a faster way to replicate the sizeable spark directory to the slaves particularly if there is a lot of not very powerful slaves. Just a thought ... cheers Daniel