Satish Kolli created SPARK-10727: ------------------------------------ Summary: Dataframe count is zero after 'except' operation Key: SPARK-10727 URL: https://issues.apache.org/jira/browse/SPARK-10727 Project: Spark Issue Type: Bug Reporter: Satish Kolli
Data frame count after the except operation is always returning zero even when there is data in the resulting data frame. {code} scala> val df1 = sc.parallelize(1 to 5).toDF("V1") df1: org.apache.spark.sql.DataFrame = [V1: int] scala> val df2 = sc.parallelize(2 to 5).toDF("V2") df2: org.apache.spark.sql.DataFrame = [V2: int] scala> df1.except(df2).show() +---+ | V1| +---+ | 1| +---+ scala> df1.except(df2).count() res4: Long = 0 scala> df1.except(df2).rdd.count() res5: Long = 1 scala> {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org