Spark sql data frames do they run in parallel by default?

2015-12-06 Thread kali.tumm...@gmail.com
Hi all, I wrote below spark code to extract data from SQL server using spark SQLContext.read.format with several different options , question does by default sqlContext.read load function run in parallel does it use all the available cores available ? when I am saving the output to a file it is

Re: Spark sql data frames do they run in parallel by default?

2015-12-06 Thread kali.tumm...@gmail.com
Hi All, I re wrote my code to use sqlContext.read.jdbc which lets me specify upperbound,lowerbound,numberofparitions etc .. which might run in parallel, I need to try on a cluster which I will do when I have time. But please confirm read.jdbc does parallel reads ? Spark code:- package