.
Thanksamp;Best regards!
San.Luo
- 原始邮件 -
发件人:Cheng, Hao hao.ch...@intel.com
收件人:Cheng, Hao hao.ch...@intel.com, luohui20...@sina.com
luohui20...@sina.com, Olivier Girardot ssab...@gmail.com, user
user@spark.apache.org
主题:RE: 回复:Re: sparksql running slow
...@intel.com
收件人:luohui20...@sina.com luohui20...@sina.com, Wang, Daoyuan
daoyuan.w...@intel.com, Olivier Girardot ssab...@gmail.com, user
user@spark.apache.org
主题:RE: 回复:Re: sparksql running slow while joining_2_tables.
日期:2015年05月05日 20点50分
56mb / 26mb is very small size, do you observe data skew? More
...@intel.com;
ssab...@gmail.com; user@spark.apache.org
Subject: 回复:回复:RE: 回复:Re: sparksql running slow while joining_2_tables.
update status after i did some tests. I modified some other parameters, found 2
parameters maybe relative.
spark_worker_instance and spark.sql.shuffle.partitions
before Today I
, Hao; Wang, Daoyuan; Olivier Girardot; user
Subject: 回复:Re: sparksql running slow while joining_2_tables.
Hi guys,
attache the pic of physical plan and logs.Thanks.
Thanksamp;Best regards!
罗辉 San.Luo
- 原始邮件 -
发件人:Cheng, Hao hao.ch
You can use
Explain extended select ….
From: luohui20...@sina.com [mailto:luohui20...@sina.com]
Sent: Tuesday, May 05, 2015 9:52 AM
To: Cheng, Hao; Olivier Girardot; user
Subject: 回复:RE: 回复:Re: sparksql running slow while joining_2_tables.
As I know broadcastjoin is automatically enabled
I assume you’re using the DataFrame API within your application.
sql(“SELECT…”).explain(true)
From: Wang, Daoyuan
Sent: Tuesday, May 5, 2015 10:16 AM
To: luohui20...@sina.com; Cheng, Hao; Olivier Girardot; user
Subject: RE: 回复:RE: 回复:Re: sparksql running slow while joining_2_tables.
You can use
slow while joining_2_tables.
日期:2015年05月05日 08点38分
Or, have you ever try broadcast join?
From: Cheng, Hao [mailto:hao.ch...@intel.com]
Sent: Tuesday, May 5, 2015 8:33 AM
To: luohui20...@sina.com; Olivier Girardot; user
Subject: RE: 回复:Re: sparksql running slow while joining 2 tables