Before torrent, http is the default way for broadcasting. The driver holds the data and the executors request the data via http, making the driver the bottleneck if the data is large. -Xiangrui
On Tue, Jul 29, 2014 at 10:32 AM, durin <m...@simon-schaefer.net> wrote: > Development is really rapid here, that's a great thing. > > Out of curiosity, how did communication work before torrent? Did everything > have to go back to the master / driver first? > > > > -- > View this message in context: > http://apache-spark-user-list.1001560.n3.nabble.com/KMeans-expensiveness-of-large-vectors-tp10614p10870.html > Sent from the Apache Spark User List mailing list archive at Nabble.com.