[GitHub] spark pull request: [SPARK-12392] Optimize a location order of bro...

2015-12-16 Thread maropu
Github user maropu commented on the pull request: https://github.com/apache/spark/pull/10346#issuecomment-165349453 I did quick benchmarks for large broadcasts; - aws m4.x4large x 4, 4 works in a host - elapsed time: -- w/opt.: 6.887943434s, w/o opt.: 11.738593435s - rec

[GitHub] spark pull request: [SPARK-12392] Optimize a location order of bro...

2015-12-16 Thread SparkQA
Github user SparkQA commented on the pull request: https://github.com/apache/spark/pull/10346#issuecomment-165349326 **[Test build #47902 has started](https://amplab.cs.berkeley.edu/jenkins/job/SparkPullRequestBuilder/47902/consoleFull)** for PR 10346 at commit [`c396640`](https://gi

[GitHub] spark pull request: [SPARK-12392] Optimize a location order of bro...

2015-12-16 Thread maropu
GitHub user maropu opened a pull request: https://github.com/apache/spark/pull/10346 [SPARK-12392] Optimize a location order of broadcast blocks by considering preferred local hosts When multiple workers exist in a host, we can bypass unnecessary remote access for broadcasts; block