Tom Lord wrote: > Two code points represent non-characters. These are U+FFFE and > U+FFFF. Programs are free to give these values special meaning > internally. Unicode (2.0 and up?) has 34 non-characters at U+xxFFFE and U+xxFFFF where xx is 00, 01, .., 0F, 10. Unicode 3.1 is adding another 32 non-characters on the BMP. See UTR 27 for details. markus
- An Aburdly Brief Introduction to Unicode (was Re: Percept... Tom Lord
- Re: An Aburdly Brief Introduction to Unicode (was Re... Peter_Constable
- Re: An Aburdly Brief Introduction to Unicode (was Re... Markus Scherer
- Re: An Aburdly Brief Introduction to Unicode (was Re... Paul Keinanen
- Re: An Aburdly Brief Introduction to Unicode (was Re... Mark Davis
- Re: An Aburdly Brief Introduction to Unicode (was Re... John Cowan
- Re: An Aburdly Brief Introduction to Unicode (was Re... Mark Davis
- Re: An Aburdly Brief Introduction to Unicode (was Re... Kenneth Whistler
- Re: An Aburdly Brief Introduction to Unicode (was Re... Peter_Constable
- Re: An Aburdly Brief Introduction to Unicode (was Re... Peter_Constable
- Re: An Aburdly Brief Introduction to Unicode (was Re... Peter_Constable
- Re: An Aburdly Brief Introduction to Unicode (was Re... Kenneth Whistler
- Re: An Aburdly Brief Introduction to Unicode (was Re... Peter_Constable