Andrey Gubichev created SPARK-45381:
---------------------------------------

             Summary: Incorrect COUNT bug handling in scalar subqueries
                 Key: SPARK-45381
                 URL: https://issues.apache.org/jira/browse/SPARK-45381
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.5.0
            Reporter: Andrey Gubichev


This query has incorrect results:
create temp view l (a, b)
as values
(1, 2.0),
(1, 2.0),
(2, 1.0),
(2, 1.0),
(3, 3.0),
(null, null),
(null, 5.0),
(6, null);

create temp view r (c, d)
as values
(2, 3.0),
(2, 3.0),
(3, 2.0),
(4, 1.0),
(null, null),
(null, 5.0),
(6, null);


select (
select sum(cnt)
from (select count(*) cnt from r where l.a = r.c)
) from l;
 
 
It returns 

-- !query output
1
1
2
2
NULL
NULL
NULL
NULL

NULLs in the output should be zeros.
 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to