[GitHub] spark pull request #21737: [SPARK-24208][SQL] Fix attribute deduplication fo...

gatorsmile Mon, 09 Jul 2018 15:17:30 -0700

Github user gatorsmile commented on a diff in the pull request:

    https://github.com/apache/spark/pull/21737#discussion_r201164293
  
    --- Diff: 
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/analysis/Analyzer.scala
 ---
    @@ -738,6 +738,10 @@ class Analyzer(
                 if 
findAliases(aggregateExpressions).intersect(conflictingAttributes).nonEmpty =>
               (oldVersion, oldVersion.copy(aggregateExpressions = 
newAliases(aggregateExpressions)))
     
    +        case oldVersion @ FlatMapGroupsInPandas(_, _, output, _)
    +            if 
AttributeSet(output).intersect(conflictingAttributes).nonEmpty =>
    --- End diff --
    
    cc @maryannxue Deduplicating on conflicting attributes in this function is 
easily broken. In the long term, this is not the perfect way to handle it. We 
should consider to fundamentally fix it.



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #21737: [SPARK-24208][SQL] Fix attribute deduplication fo...

Reply via email to