When it comes to methods operating on
buffers there's always the tension between viewing the buffer as
text elements vs. as data elements. For some purposes, from error
detection to data cleanup you need to be able to treat the buffer
as data elements. For many other operations, a focus on text
elements is enough.
If you desire to have a regex that you can use to validate a raw buffer, then that regex must do something sensible with partial code points. If you don't have multiple regex engines, then limiting your single one to valid input prevents you from using it everywhere. A./ On 10/20/2015 3:06 AM, Philippe Verdy wrote:
|
- Unpaired surrogates (was: Re: Why Work at Encoding Leve... Doug Ewell
- Re: Unpaired surrogates (was: Re: Why Work at Enco... Philippe Verdy
- Re: Unpaired surrogates (was: Re: Why Work at Enco... Markus Scherer
- Re: Unpaired surrogates Richard Wordingham
- Re: Unpaired surrogates Philippe Verdy
- Re: Unpaired surrogates Asmus Freytag (t)