Re: df.count() returns one more count than SELECT COUNT()

2017-04-06 Thread Mohamed Nadjib MAMI
count > > res1: Long = 1 > > > > Regards, > > Hemanth > > > > *From: *Mohamed Nadjib Mami <mohamed.nadjib.m...@gmail.com> > *Date: *Thursday, 6 April 2017 at 20.29 > *To: *"user@spark.apache.org" <user@spark.apache.org> > *Subject: *df.count() returns one more count than SELECT COUNT() > > > > *spark.sql("SELECT count(distinct col) FROM Table").show()* >

Re: df.count() returns one more count than SELECT COUNT()

2017-04-06 Thread Hemanth Gudela
ot;select distinct null").count res1: Long = 1 Regards, Hemanth From: Mohamed Nadjib Mami <mohamed.nadjib.m...@gmail.com> Date: Thursday, 6 April 2017 at 20.29 To: "user@spark.apache.org" <user@spark.apache.org> Subject: df.count() returns one more count than SELECT COUNT() spark.sql("SELECT count(distinct col) FROM Table").show()

df.count() returns one more count than SELECT COUNT()

2017-04-06 Thread Mohamed Nadjib Mami
I paste this right from Spark shell (Spark 2.1.0): /scala> spark.sql("SELECT count(distinct col) FROM Table").show()// //+-+ // //|count(DISTINCT col)|// //+-+// //|4697|// //+-+// //scala>