On 02/12/2019 05:56, Doug McKenna wrote:
- \lccode and/or \uccode for non-letter code points
   for which an upper or lower case mapping is given

The problem with this is that earlier, it is stated that all combining mark 
code points (class code starting with 'M' in the UnicodeData.txt file) are to 
be considered letters (\catcode set to 11).  So there's an ambiguity here that 
needs clearing up.  Does the above apply to combining mark code points or not?

You've read something in that is not in the README ;)

The file says

  - `\catcode` 11 for all combining marks (Unicode class "M")

where I've very deliberately kept the TeX 'side' as what *actually happens* (catcode-11), not said they are 'treated as letters', or similar.

I will clarify that 'letter' here means a codepoint with Unicode character class "L", and is not linked to the TeX catcode.

It may be that none of the combining marks in the data file have any case 
mappings, but there's no guarantee that is true.  So the question is, if a 
combining mark has an uppercase or lowercase mapping, does that get installed 
in \lccode and/or \uccode?

Yes, or at least would be the case in principle: all code points with upper/lower/title properties are set up.

Also, there's a confusing typo ("can"?) in

- \lccode and \uccode for all of class "Lt" (title
   case letters) to the lower can upper case mappings
   (or if not given to the code point itself)

Should "can' be "and/or"?

It is 'and': you need to set lccode and uccode for these code points.

Joseph

Reply via email to