Yeah I heard about that. This IMHO is a bit more worrying, and we do not have teh "excuse" that it is transparent. Also, which of these would be STRING and which IDENTIFIER?
On Mar 25, 2024 at 1:06 PM -0700, Alex Cruise <a...@cluonflux.com>, wrote: While we're at it, maybe consider allowing "smart quotes" too :) -0xe1a On Sat, Mar 23, 2024 at 5:29 PM serge rielau.com<http://rielau.com> <se...@rielau.com<mailto:se...@rielau.com>> wrote: Hello, I have a PR https://github.com/apache/spark/pull/45620 ready to go that will extend the definition of whitespace (what separates token) from the small set of ASCII characters space, tab, linefeed to those defined in Unicode. While this is a small and safe change, it is one where we would have a hard time changing our minds about later. It is also a change that, AFAIK, cannot be controlled under a config. What does the community think? Cheers Serge SQL Architect at Databricks