Re: Fighting against performance: JDBC RDD badly distributed

2015-07-28 Thread shenyan zhen
Hi Saif, Are you using JdbcRDD directly from Spark? If yes, then the poor distribution could be due to the bound key you used. See the JdbcRDD Scala doc at https://spark.apache.org/docs/latest/api/scala/index.html#org.apache.spark.rdd.JdbcRDD : sql the text of the query. The query must contain

RE: Fighting against performance: JDBC RDD badly distributed

2015-07-28 Thread Saif.A.Ellafi
. Saif From: shenyan zhen [mailto:shenya...@gmail.com] Sent: Tuesday, July 28, 2015 4:16 PM To: Ellafi, Saif A. Cc: user@spark.apache.org Subject: Re: Fighting against performance: JDBC RDD badly distributed Hi Saif, Are you using JdbcRDD directly from Spark? If yes, then the poor distribution could

Re: Fighting against performance: JDBC RDD badly distributed

2015-07-28 Thread shenyan zhen
:* Re: Fighting against performance: JDBC RDD badly distributed Hi Saif, Are you using JdbcRDD directly from Spark? If yes, then the poor distribution could be due to the bound key you used. See the JdbcRDD Scala doc at https://spark.apache.org/docs/latest/api/scala/index.html