On Sun, Jan 25, 2009 at 16:50, PauAmma <[email protected]> wrote: > On Sun, 25 Jan 2009, Philip Newton wrote: > >> What's probably least problematic is allowing only Latin, including >> accented Latin, but no other scripts. > > Which "accented Latin"? 8859-1? 8859-2? 8859-15? (And I'm probably missing > some)
I was probably thinking of something along the lines of (the alphabetic characters in) the following named Unicode blocks: Basic Latin Latin-1 (aka Latin-1 Supplement) Latin Extended A Latin Extended B Latin Extended C Latin Extended D Latin Extended Additional (See http://www.unicode.org/charts/ , bottom left.) ISO 8859-1 is "obviously" essentially Basic Latin + Latin-1; the alphabetic characters in ISO 8859-2 (Latin-2) and ISO 8859-15 (Latin-9) should also all be contained in the above blocks. Probably even in the first three, i.e. Basic + L-1 + Ext-A -- and maybe those are sufficient, since Ext-B to -D include increasingly-obscure (IMAO) letters. Cheers, -- Philip Newton <[email protected]> _______________________________________________ dw-discuss mailing list [email protected] http://lists.dwscoalition.org/cgi-bin/mailman/listinfo/dw-discuss
