+1, this is a reasonable change. Gengliang
On Wed, Mar 27, 2024 at 9:54 AM serge rielau.com <se...@rielau.com> wrote: > Going once, going twice, …. last call for objections > On Mar 23, 2024 at 5:29 PM -0700, serge rielau.com <se...@rielau.com>, > wrote: > > Hello, > > I have a PR https://github.com/apache/spark/pull/45620 ready to go that > will extend the definition of whitespace (what separates token) from the > small set of ASCII characters space, tab, linefeed to those defined in > Unicode. > While this is a small and safe change, it is one where we would have a > hard time changing our minds about later. > It is also a change that, AFAIK, cannot be controlled under a config. > > What does the community think? > > Cheers > Serge > SQL Architect at Databricks > >