[ https://issues.apache.org/jira/browse/SPARK-25230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Yuming Wang updated SPARK-25230: -------------------------------- Description: How to reproduce: {code:sql} spark-sql> SELECT upper('Haßler'); HASSLER {code} Mainstream databases returns {{HAßLER}}. !MySQL.png! This behavior may lead to data inconsistency: {code:sql} create temporary view SPARK_25230 as select * from values ("Hassler"), ("Haßler") as EMPLOYEE(name); select UPPER(name) from SPARK_25230 group by 1; {code} was: How to reproduce: {code:sql} spark-sql> SELECT upper('Haßler'); HASSLER {code} Mainstream databases returns {{HAßLER}}. !MySQL.png! This behave {code:sql} create temporary view SPARK_25230 as select * from values ("Hassler"), ("Haßler") as EMPLOYEE(name); select UPPER(name) from SPARK_25230 group by 1; {code} > Upper behaves incorrect for string contains "ß" > ----------------------------------------------- > > Key: SPARK-25230 > URL: https://issues.apache.org/jira/browse/SPARK-25230 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 2.3.1 > Reporter: Yuming Wang > Priority: Major > Attachments: MySQL.png, Oracle.png, Teradata.jpeg > > > How to reproduce: > {code:sql} > spark-sql> SELECT upper('Haßler'); > HASSLER > {code} > Mainstream databases returns {{HAßLER}}. > !MySQL.png! > > This behavior may lead to data inconsistency: > {code:sql} > create temporary view SPARK_25230 as select * from values > ("Hassler"), > ("Haßler") > as EMPLOYEE(name); > select UPPER(name) from SPARK_25230 group by 1; > {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org