Re: about broadcast join of base table in spark sql

2017-07-02 Thread paleyl
spark/sql/functions.html#broadcast(org.apache.spark.sql.DataFrame)> >> spark.apache.org >> Computes the numeric value of the first character of the string column, >> and returns the result as a int column. >> >> >> >> >> -- &g

Fwd: about broadcast join of base table in spark sql

2017-06-30 Thread paleyl
Hi All, Recently I meet a problem in broadcast join: I want to left join table A and B, A is the smaller one and the left table, so I wrote A = A.join(B,A("key1") === B("key2"),"left") but I found that A is not broadcast out, as the shuffle size is still very large. I guess this is a designed

about broadcast join of base table in spark sql

2017-06-28 Thread paleyl
Hi All, Recently I meet a problem in broadcast join: I want to left join table A and B, A is the smaller one and the left table, so I wrote A = A.join(B,A("key1") === B("key2"),"left") but I found that A is not broadcast out, as the shuffle size is still very large. I guess this is a designed