Re: [fossil-users] File contains invalid UTF-8, but is not UTF-8.

Shal Farley Tue, 08 Jul 2014 15:01:25 -0700

Andy,

> If no characters between 128 and 255 are valid UTF-8, and they can
> never be valid UTF-8  characters, and are used by many  encodings,
> why doesn't Fossil simply ignore them when they  are committed?

I think Stephan said it poorly. A solitary byte in that range is nevervalid UTF-8, but UTF-8 represents all code points higher than 127 as asequence of bytes in the 128 to 255 range. Those byte sequences have astructure, so it is possible to tell if a string of bytes in that rangerepresents a valid UTF-8 sequence.


-- Shal

--
Shal Farley                                 s...@cheshireeng.com
Cheshire Engineering Corporation      http://www.CheshireEng.com
120 West Olive Avenue                            +1 626 303 1602
Monrovia, CA 91016                           FAX +1 626 303 1590
_______________________________________________
fossil-users mailing list
fossil-users@lists.fossil-scm.org
http://lists.fossil-scm.org:8080/cgi-bin/mailman/listinfo/fossil-users

Re: [fossil-users] File contains invalid UTF-8, but is not UTF-8.

Reply via email to