[GitHub] spark pull request #19707: [SPARK-22472][SQL] add null check for top-level p...

cloud-fan Thu, 09 Nov 2017 09:03:04 -0800

Github user cloud-fan commented on a diff in the pull request:

    https://github.com/apache/spark/pull/19707#discussion_r150024246
  
    --- Diff: sql/core/src/test/scala/org/apache/spark/sql/DatasetSuite.scala 
---
    @@ -1408,6 +1409,23 @@ class DatasetSuite extends QueryTest with 
SharedSQLContext {
           checkDataset(ds, SpecialCharClass("1", "2"))
         }
       }
    +
    +  test("SPARK-22472: add null check for top-level primitive values") {
    +    // If the primitive values are from Option, we need to do runtime null 
check.
    +    val ds = Seq(Some(1), None).toDS().as[Int]
    +    intercept[NullPointerException](ds.collect())
    +    val e = intercept[SparkException](ds.map(_ * 2).collect())
    +    assert(e.getCause.isInstanceOf[NullPointerException])
    +
    +    withTempPath { path =>
    +      Seq(new Integer(1), 
null).toDF("i").write.parquet(path.getCanonicalPath)
    --- End diff --
    
    not a big deal, but `toDF("i")` is more explicit about column name.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #19707: [SPARK-22472][SQL] add null check for top-level p...

Reply via email to