Lauri Koobas created SPARK-34993:
------------------------------------

             Summary: from_json() acts differently on created and literal 
strings with backslashes
                 Key: SPARK-34993
                 URL: https://issues.apache.org/jira/browse/SPARK-34993
             Project: Spark
          Issue Type: Bug
          Components: SQL
    Affects Versions: 3.1.1
         Environment: Databricks DBR 8.1
            Reporter: Lauri Koobas


JSON string with the value that contains backslashes fails to be recovered by 
`from_json()`.

I found that if the same string is created with `to_json(named_struct())` then 
it actually does work.

 

The following code to reproduce. I would expect all of these methods to return 
the same (correct) result:

```select to_json(named_struct('msg', '\\'))
, schema_of_json(to_json(named_struct('msg', '\\')))
, from_json(to_json(named_struct('msg', '\\')), 
schema_of_json(to_json(named_struct('msg', '\\'))))
, from_json('\{"msg":"\\"}', schema_of_json(to_json(named_struct('msg', '\\'))))
, from_json('\{"msg":"\\"}', 'msg string')

```



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to