Hi Derick, > On 9 Dec 2014, at 16:54, Derick Rethans <der...@php.net> wrote: > >> On Tue, 9 Dec 2014, Andrea Faulds wrote: >> >> I think \x{xxxx} is misleading anyway - \xXX is always >> single-byte/character, yet Unicode code points can’t be represented in >> PHP strings as single bytes when encoded in UTF-8 (unless they’re >> below U+0100, of course). > > You mean below U+0080 surely? Only the "first 7 bits" can be represented > as a single byte with UTF-8. U+0080 is for example 0xC2 0x80 in UTF-8.
Ah, yes, my bad. I was probably getting confused with how all Unicode codepoints up to U+0100 match Latin-1. Thanks! -- Andrea Faulds http://ajf.me/ -- PHP Internals - PHP Runtime Development Mailing List To unsubscribe, visit: http://www.php.net/unsub.php