回复:RE: 回复:Re: sparksql running slow while joining_2_tables.

2015-05-13 Thread luohui20001
. Thanksamp;Best regards! San.Luo - 原始邮件 - 发件人:Cheng, Hao hao.ch...@intel.com 收件人:Cheng, Hao hao.ch...@intel.com, luohui20...@sina.com luohui20...@sina.com, Olivier Girardot ssab...@gmail.com, user user@spark.apache.org 主题:RE: 回复:Re: sparksql running slow

回复:回复:RE: 回复:Re: sparksql running slow while joining_2_tables.

2015-05-06 Thread luohui20001
...@intel.com 收件人:luohui20...@sina.com luohui20...@sina.com, Wang, Daoyuan daoyuan.w...@intel.com, Olivier Girardot ssab...@gmail.com, user user@spark.apache.org 主题:RE: 回复:Re: sparksql running slow while joining_2_tables. 日期:2015年05月05日 20点50分 56mb / 26mb is very small size, do you observe data skew? More

RE: 回复:回复:RE: 回复:Re: sparksql running slow while joining_2_tables.

2015-05-06 Thread java8964
...@intel.com; ssab...@gmail.com; user@spark.apache.org Subject: 回复:回复:RE: 回复:Re: sparksql running slow while joining_2_tables. update status after i did some tests. I modified some other parameters, found 2 parameters maybe relative. spark_worker_instance and spark.sql.shuffle.partitions before Today I

RE: 回复:Re: sparksql running slow while joining_2_tables.

2015-05-05 Thread Cheng, Hao
, Hao; Wang, Daoyuan; Olivier Girardot; user Subject: 回复:Re: sparksql running slow while joining_2_tables. Hi guys, attache the pic of physical plan and logs.Thanks. Thanksamp;Best regards! 罗辉 San.Luo - 原始邮件 - 发件人:Cheng, Hao hao.ch

RE: 回复:RE: 回复:Re: sparksql running slow while joining_2_tables.

2015-05-04 Thread Wang, Daoyuan
You can use Explain extended select …. From: luohui20...@sina.com [mailto:luohui20...@sina.com] Sent: Tuesday, May 05, 2015 9:52 AM To: Cheng, Hao; Olivier Girardot; user Subject: 回复:RE: 回复:Re: sparksql running slow while joining_2_tables. As I know broadcastjoin is automatically enabled

Re: sparksql running slow while joining_2_tables.

2015-05-04 Thread Cheng, Hao
I assume you’re using the DataFrame API within your application. sql(“SELECT…”).explain(true) From: Wang, Daoyuan Sent: Tuesday, May 5, 2015 10:16 AM To: luohui20...@sina.com; Cheng, Hao; Olivier Girardot; user Subject: RE: 回复:RE: 回复:Re: sparksql running slow while joining_2_tables. You can use

回复:RE: 回复:Re: sparksql running slow while joining_2_tables.

2015-05-04 Thread luohui20001
slow while joining_2_tables. 日期:2015年05月05日 08点38分 Or, have you ever try broadcast join? From: Cheng, Hao [mailto:hao.ch...@intel.com] Sent: Tuesday, May 5, 2015 8:33 AM To: luohui20...@sina.com; Olivier Girardot; user Subject: RE: 回复:Re: sparksql running slow while joining 2 tables