[GitHub] spark pull request #23124: [SPARK-25829][SQL] remove duplicated map keys wit...

dongjoon-hyun Thu, 22 Nov 2018 22:08:06 -0800

Github user dongjoon-hyun commented on a diff in the pull request:

    https://github.com/apache/spark/pull/23124#discussion_r235851923
  
    --- Diff: 
sql/core/src/main/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetRowConverter.scala
 ---
    @@ -558,8 +558,11 @@ private[parquet] class ParquetRowConverter(
     
         override def getConverter(fieldIndex: Int): Converter = 
keyValueConverter
     
    -    override def end(): Unit =
    +    override def end(): Unit = {
    +      // The parquet map may contains null or duplicated map keys. When it 
happens, the behavior is
    +      // undefined.
    --- End diff --
    
    What about creating a Spark JIRA issue for this and embedded that ID here?



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #23124: [SPARK-25829][SQL] remove duplicated map keys wit...

Reply via email to