J.P Feng created SPARK-19037:
--------------------------------

             Summary: Run count(distinct name) from sub query found some errors
                 Key: SPARK-19037
                 URL: https://issues.apache.org/jira/browse/SPARK-19037
             Project: Spark
          Issue Type: Bug
          Components: Spark Shell, SQL
    Affects Versions: 2.1.0
         Environment: spark 2.1.0, scala 2.11 
            Reporter: J.P Feng


when i use spark-shell or spark-sql to execute count(distinct name) from 
subquery, some errors occur:

select count(distinct name) from (select * from mytest limit 10) as a

if i do this in hive-server2, i can get the correct result.

if i just execute select count(name) from (select * from mytest limit 10) as a, 
i can also get the right result.

besides, i found the same errors when i use max(), distinct(),groupby() with 
subquery.

I think there maybe some bugs when doing key-reduce jobs with subquery.

I will add the errors in new comment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to