RE: Spark SQL question: why build hashtable for both sides in HashOuterJoin?

2014-09-30 Thread Haopu Wang
Hi, Liquan, thanks for the response. In your example, I think the hash table should be built on the right side, so Spark can iterate through the left side and find matches in the right side from the hash table efficiently. Please comment and suggest, thanks again!

Cluster tests failing

2014-09-30 Thread Debasish Das
Hi, Inside mllib I am running tests using: mvn -Dhadoop.version=2.3.0-cdh5.1.0 -Phadoop-2.3 -Pyarn install The locat tests run fine but cluster tests are failing.. LBFGSClusterSuite: - task size should be small *** FAILED *** org.apache.spark.SparkException: Job aborted due to stage

Re: Spark SQL question: why build hashtable for both sides in HashOuterJoin?

2014-09-30 Thread Liquan Pei
Hi Haopu, How about full outer join? One hash table may not be efficient for this case. Liquan On Mon, Sep 29, 2014 at 11:47 PM, Haopu Wang hw...@qilinsoft.com wrote: Hi, Liquan, thanks for the response. In your example, I think the hash table should be built on the right side, so

Re: Cluster tests failing

2014-09-30 Thread Debasish Das
I have done mvn clean several times... Consistently all the mllib tests that are using LocalClusterSparkContext.scala, they fail !

Re: Cluster tests failing

2014-09-30 Thread Xiangrui Meng
Try to build the assembly jar first. ClusterSuite uses local-cluster mode, which requires the assembly jar. -Xiangrui On Tue, Sep 30, 2014 at 8:23 AM, Debasish Das debasish.da...@gmail.com wrote: I have done mvn clean several times... Consistently all the mllib tests that are using

Re: jenkins downtime/system upgrade wednesday morning, 730am PDT

2014-09-30 Thread shane knapp
https://issues.apache.org/jira/browse/SPARK-3745 On Tue, Sep 30, 2014 at 10:22 AM, shane knapp skn...@berkeley.edu wrote: (this time, reply to all) nice catch. there's a bug in spark/dev/check-license, which i've confirmed from the CLI. i'll open a bug and PR to fix it. On Mon, Sep 29,

Re: jenkins downtime/system upgrade wednesday morning, 730am PDT

2014-09-30 Thread shane knapp
reminder: this is happening tomorrow morning. i will be putting jenkins in to quiet mode at ~7am, and then doing the upgrade once any stray builds finish. On Mon, Sep 29, 2014 at 1:43 PM, shane knapp skn...@berkeley.edu wrote: happy monday, everyone! remember a few weeks back when i