Good evening to all,
i have been doing a review of my utf8 program that some of you have
received. I have found a slight error in the code that will effect
determining if a particular sequence is valid UFT-8.
It is found in the initialisation code in *procedure utf8initialise()*
The following two lines need to be updated
every utf8counttable[charlist[241 to 245]] := 1 # \xF0 -> \xF*5*
change to
every utf8counttable[charlist[241 to 245]] := 1 # \xF0 -> \xF*4*
and
utf8limittable[4] := [["\xF0", "\xF*5*"], ["\x80", "\xBF"],
["\x80", "\xBF"], ["\x80", "\xBF"]]
change to
utf8limittable[4] := [["\xF0", "\xF4"], ["\x80", "\xBF"],
["\x80", "\xBF"], ["\x80", "\xBF"]]
In addition, some of the code points starting with "\xF4" in which the
value is greater than 0x10FFFF should be classified as invalid but are
not currently tested for.
If you are using this code for any purpose, please make the required
updates.
regards
Bruce Rennie
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Unicon-group mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/unicon-group