乶 appears to be the correct character for 20086. I checked it with http://www.visibone.com/htmlref/char/cer.htm which is a handy page containing all the 16-bit chars. http://www.visibone.com/htmlref/char/cer20000.htm is the detailed page for that that neighborhood.
The URL you cited appeared to be for 20000 to 20FFF in hex? ----- Original Message ----- From: Raul Miller <[email protected]> Date: Friday, April 22, 2011 7:40 Subject: Re: [Jprogramming] 32 bit wide unicode characters? To: Programming forum <[email protected]> > On Fri, Apr 22, 2011 at 10:26 AM, bill lam > <[email protected]> wrote: > > what is combining character? how is that related to 32-bit unicode? > > may be just ,|._2]\'asdf' to reverse in pairs. > > But, ,|._2]\'asdf' would give us dfas > instead of fdsa, so I do not > think that that is a viable option. > > Ok, you are right though, it's easy to show that J can handle unicode > characters larger than 16 bit: > > 3 &u:inv 20086 > 乶 > # 3 &u:inv 20086 > 1 > > The displayed character looks somewhat different than what I expected > from > http://en.wikibooks.org/wiki/Unicode/Character_reference/20000-20FFF > but I do not know enough about the character to say that this is an > error. > > So, ok, they are asking for special treatment of combining characters. > I think that that needs better treatment than was given, > and will > take that up on the rosettacode site. ---------------------------------------------------------------------- For information about J forums see http://www.jsoftware.com/forums.htm
