[ https://issues.apache.org/jira/browse/SPARK-21160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16057614#comment-16057614 ]
Edoardo Vivo commented on SPARK-21160: -------------------------------------- Sorry for the stupid question, but may I ask WHY this is the expected behavior? 1 is different from null... BTW, this is not pandas behavior, for instance. I really don't understand. Thank you. > Filtering rows with "not equal" operator yields unexpected result with null > rows > -------------------------------------------------------------------------------- > > Key: SPARK-21160 > URL: https://issues.apache.org/jira/browse/SPARK-21160 > Project: Spark > Issue Type: Bug > Components: PySpark, Spark Core, SQL > Affects Versions: 2.0.2 > Reporter: Edoardo Vivo > Priority: Minor > > ``` > schema = StructType([StructField("Test", DoubleType())]) > test2 = spark.createDataFrame([[1.0],[1.0],[2.0],[2.0],[None]], schema=schema) > test2.where("Test != 1").show() > ``` > This returns only the rows with the value 2, it does not return the null row. > This should not be the expected behavior, IMO. > Thank you. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org