Re: Spark 1.6.1 DataFrame write to JDBC

2016-04-21 Thread Jonathan Gray
in parallel. So you > could be swamping your database. > Which database are you using? > > Also, how many hops? > Network latency could also impact performance too… > > On Apr 19, 2016, at 3:14 PM, Jonathan Gray <jonny.g...@gmail.com> wrote: > > Hi, > &

Re: Spark 1.6.1 DataFrame write to JDBC

2016-04-21 Thread Jonathan Gray
gt; > On Thu, Apr 21, 2016 at 2:15 PM, Takeshi Yamamuro <linguin@gmail.com> > wrote: > >> Hi, >> >> How about trying to increate 'batchsize >> >> On Wed, Apr 20, 2016 at 7:14 AM, Jonathan Gray <jonny.g...@gmail.com> >> wrote: >> &

Spark 1.6.1 DataFrame write to JDBC

2016-04-19 Thread Jonathan Gray
Hi, I'm trying to write ~60 million rows from a DataFrame to a database using JDBC using Spark 1.6.1, something similar to df.write().jdbc(...) The write seems to not be performing well. Profiling the application with a master of local[*] it appears there is not much socket write activity and