If you use Unicode 16-bit strings, it's easy to "pass through" unpaired surrogates and treat them like code points; it's often not productive or necessary to check for them all the time, that is, to be strict about UTF-16.
On the other hand, I don't think anyone expects you to support invalid UTF-8, and especially not to support any and all Unicode 8-bit strings (see Unicode 3.9 Unicode Encoding Forms for what I mean here). If you find UTS #18 unclear or misleading, I suggest you submit feedback pointing out specific text issues. markus
_______________________________________________ Unicode mailing list Unicode@unicode.org http://unicode.org/mailman/listinfo/unicode