On Wed, Mar 21, 2018 at 11:40 AM, Anne van Kesteren <ann...@annevk.nl> wrote: > On Wed, Mar 21, 2018 at 10:27 AM, Henri Sivonen <hsivo...@hsivonen.fi> wrote: >> * A bunch of things atomicize URL components. I'd hope that the URLs >> were converted from UTF-16 to UTF-8 at some prior point ensuring UTF-8 >> validity, but it's hard to be sure. > > At least per the specification all URL components end up with only > ASCII code points after parsing the URL.
Good point. Thanks. I'll go ahead with doing WHATWG-compliant "with replacement" when trying to atomize invalid UTF-8 (which shouldn't happen anyway). -- Henri Sivonen hsivo...@hsivonen.fi https://hsivonen.fi/ _______________________________________________ dev-platform mailing list dev-platform@lists.mozilla.org https://lists.mozilla.org/listinfo/dev-platform