date:20140606

Re: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

2014-06-06 Thread Doug Ewell

Philippe Verdy wrote: >> If you have an arbitrary fragment of data, don't fiddle with it. > > Thisis your scenario. The simple concept of a unique "start" of text > does not exist in live streams that can start anywhere. So you cannot > always expect that U+FEFF or U+FFFE will only exist once in

Re: Corner cases (was: Re: UTF-16 Encoding Scheme and U+FFFE)

2014-06-06 Thread Steffen Nurpmeso

"Doug Ewell" wrote: |Philippe Verdy wrote: |> Not necessarily true. |> |> [602 words] | |This has nothing to do with the scenario I described, which involved |removing a "BOM" from the start of an arbitrary fragment of data, |thereby corrupting the data because the "BOM" was actually a ZW

Re: Swift

2014-06-06 Thread Richard Wordingham

On Fri, 6 Jun 2014 01:55:57 +0200 Philippe Verdy wrote: > IMHO, a programming language that accepts non-ASCII identifiers should > always nrmalize the identifiers it accepts, before heeding it in its > hashed symbol table. Unfortunately, C and C++ don't normalise. Consequently, all a compiler c