While we're at it, maybe consider allowing "smart quotes" too :) -0xe1a
On Sat, Mar 23, 2024 at 5:29 PM serge rielau.com <se...@rielau.com> wrote: > Hello, > > I have a PR https://github.com/apache/spark/pull/45620 ready to go that > will extend the definition of whitespace (what separates token) from the > small set of ASCII characters space, tab, linefeed to those defined in > Unicode. > While this is a small and safe change, it is one where we would have a > hard time changing our minds about later. > It is also a change that, AFAIK, cannot be controlled under a config. > > What does the community think? > > Cheers > Serge > SQL Architect at Databricks > >