Consonant shifters and ZWNJ in Khmer

2018-07-20 Thread Norbert Lindenberg via Unicode
The section on consonant shifters in the Khmer section of the Unicode standard (page 647 of Unicode 11 [1]) isn’t entirely clear on where the zero width non-joiner should be placed to prevent a consonant shifter that’s followed by an above-base vowel from being changed to a below-base glyph. Fi

Re: metric for block coverage

2018-02-23 Thread Norbert Lindenberg via Unicode
> On Feb 18, 2018, at 3:26 , Khaled Hosny via Unicode > wrote: > > On Sun, Feb 18, 2018 at 02:14:46AM -0800, James Kass via Unicode wrote: >> Adam Borowski wrote, >> >>> I'm looking for a way to determine a font's coverage of available scripts. >>> It's probably reasonable to do this per Unico

Re: Unicode education in Schools

2017-08-26 Thread Norbert Lindenberg via Unicode
ECMAScript 6 fixed that, largely along the lines of my proposal: http://norbertlindenberg.com/2012/05/ecmascript-supplementary-characters/index.html Norbert > On Aug 24, 2017, at 22:14 , Peter Constable via Unicode > wrote: > > I thought Javascript had a UCS-2 understanding of Unicode strings

Re: Northern Khmer on iPhone

2017-03-02 Thread Norbert Lindenberg
On iOS, applications can and do install custom fonts for system-wide use, although the installation user experience is pretty bad: http://norbertlindenberg.com/2015/06/installing-fonts-on-ios/index.html Norbert > On Mar 1, 2017, at 18:43 , Alastair Houghton > wrote: […] > (Also, FYI, iOS ap

Re: Suppressing Ligation of Spacing Marks

2016-11-09 Thread Norbert Lindenberg
The part of the specification of the Universal Shaping Engine [1] that deals with ZWNJ is a bit unclear, but I read it to mean that ZWNJ should not cause the insertion of a dotted circle if the character following it has general category Mn or Mc. The USE specification says: "The zero-width non

Re: Unicode in passwords

2015-10-06 Thread Norbert Lindenberg
> On Oct 6, 2015, at 6:04 , Philippe Verdy wrote: > > In those conditions, normalizing the Java string will leave those lone > surrogates (and non-characters) as is, or will throw an exception, depending > on the API used. Java strings do not have any implied encoding (their "char" > members

Re: Ways to detect that XXXX in JSON \uXXXX does not correspond to a Unicode character?

2015-05-09 Thread Norbert Lindenberg
RFC 7158 section 7 [1] provides not only the \u notation for Unicode code points in the Basic Multilingual Plane, but also a 12-character sequence encoding the UTF-16 surrogate pair (i.e. \u\u with 0xD800 ≤ < 0xDC00 ≤ ≤ 0xDFFF) for supplementary Unicode code points. A tool

Re: Swift

2014-06-08 Thread Norbert Lindenberg
It does allow some usage that may surprise code reviewers – for example, this is a valid Swift program: let s = "😄" let s︀ = "😞" let ︀ = "😉" let all = s + s︀ + ︀ The value of the constant “all” is "😄😞😉". Or at least it is as long as mail software doesn’t harm the variation selectors… Norbert

Re: Some much-needed improvements in JavaScript i18n

2012-12-20 Thread Norbert Lindenberg
I recommend that people interested in the ECMAScript Internationalization API read the actual standard or my introduction to it, and don't rely on Philippe's interpretation. http://www.ecma-international.org/ecma-402/1.0/ http://norbertlindenberg.com/2012/12/ecmascript-internationalization-api/

Re: data for cp1252

2012-12-08 Thread Norbert Lindenberg
On Dec 7, 2012, at 17:48 , Buck Golemon wrote: > It's also correct. *All* browsers have this behavior. The W3C has found this > behavior to be correct. Opera at one point in time implemented the current > unicode.org cp1252 spec, but was forced to change to the W3C spec by > real-world requirem

Re: Opinions on this Java URL?

2004-11-15 Thread Norbert Lindenberg
Theodore, Thank you for your feedback. Adding a warning to the description in DataInput sounds like a good idea. In the meantime, if somebody wants to use modified UTF-8 outside the Java context, please point them to http://java.sun.com/developer/technicalArticles/Intl/Supplementary/ index.htm

Supplementary characters in the Java(TM) platform

2004-02-27 Thread Norbert Lindenberg
I know a number of you are curious about how the Java platform will support the supplementary characters of the Unicode standard. The JSR 204 expert group, consisting of experts of ten companies, has today published the Public Review Draft of its specification at: http://jcp.org/aboutJava/co

Re: Does Java 1.5 support Unicode math alphanumerics as variable names?

2004-02-26 Thread Norbert Lindenberg
Murray, Yes, starting from J2SE 1.5 the Java programming language allows supplementary characters in identifiers if they meet the specifications of the new methods java.lang.Character.isJavaIdentifierStart(int) and java.lang.Character.isJavaIdentifierPart(int). Sorry for the late reply - until

Re: Traditional dollar sign

2003-10-27 Thread Norbert Lindenberg
The holographic strip on the Euro notes shows the Euro symbol when viewed at certain angles. Norbert Peter Kirk wrote: > > The latest issue of UK banknotes do carry the pound sterling sign (with > one crossbar), but this is quite new. At least the more recent former > issues did not, if I rememb