Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Doug Ewell
David Starner <[EMAIL PROTECTED]> wrote: >> IMHO, Unicode Consotium should make these people happy before Tolkien >> fans and Trekkies. > > Why? How does the happiness of Tolkein fans and Trekkies have anything > do with the happiness of those Japanese? Encoding characters is largely > a parell

Re: Missing values in mapping-tables?

2002-03-15 Thread Doug Ewell
Lars Kristan <[EMAIL PROTECTED]> wrote: > Suppose ISO 8859-8 is ever upgraded (even if not likely, but - for the sake > of argument). One might say that it would be bad to change an existing > definition in the table e.g. for 0xBF from 0x2DBF to 0x20AC. But how is that > worse from changing it fr

Re: Technical Merit vs. Artistic Impression

2002-03-15 Thread David Starner
On Sat, Mar 16, 2002 at 07:32:19AM +0900, Dan Kogai wrote: > On Saturday, March 16, 2002, at 06:32 , David Starner wrote: > >So people can't get work done, and poets are silenced because of > >politics. The very concept is something to which I object most strongly. > > To me such poets who give

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Ben Monroe
Kenneth Whistler wrote: > Ben Monroe wrote: > > > As it is a personal spelling, I never expected > > Unicode to map a code point to this character to me. > > For those not following the Japanese in the UTF-8, Ben's name > is Monryuu Ben in kanji. This is a sound-based name coinage > for an English

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Thomas Chan
On Fri, 15 Mar 2002, Kenneth Whistler wrote: > Ben Monroe wrote: > > As it is a personal spelling, I never expected > > Unicode to map a code point to this character to me. > > For those not following the Japanese in the UTF-8, Ben's name > is Monryuu Ben in kanji. This is a sound-based name co

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Thomas Chan
On Fri, 15 Mar 2002, Jungshik Shin wrote: > On Sat, 16 Mar 2002, Dan Kogai wrote: > > (*) My parents wanted me to name me ÷¥ (U+5F48), a classical form, but > > it was not listed on "the table of Kanjis allowed for names" so I was > > named U+5F3E. > > Frankly speaking, I find it rather har

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Thomas Chan
On Fri, 15 Mar 2002, John H. Jenkins wrote: > On Friday, March 15, 2002, at 11:38 AM, Dan Kogai wrote: > > > There are so many Watanabe-sans, Saito-san, and others whose name cannot > > be spelled in Unicode. > > Can you document this? You know, there's a prize offered for the first > person

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Rick McGowan
Ben Monroe wrote: > While I lived in Japan, I always wrote my last name with $BN6(B stuck > inside of $BLg(B (in the space in the lower middle). Of course, anyone can do that -- Han is known to be a productive system and people have alyways played games with characters for fun and scholar

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 07:27 , Kenneth Whistler wrote: > *What* still holds true? These are just well-worn issues of itaiji > (variant forms). The characters from the little anime exhibit of > variants are, in Unicode: > > U+9AD8 / U+9AD9 > U+5516 / U+555E > U+9593 / U+9592 > > all varian

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Kenneth Whistler
Ben Monroe wrote: > As it is a personal spelling, I never expected > Unicode to map a code point to this character to me. For those not following the Japanese in the UTF-8, Ben's name is Monryuu Ben in kanji. This is a sound-based name coinage for an English name. Mon 'gate' ryuu ~ ryoo 'dragon

Re: Regd- ISCII to Unicode Converter!

2002-03-15 Thread Markus Scherer
ICU supports ISCII, except for the font-style attributes (like "bold") which are not expressible in plain text. http://oss.software.ibm.com/cvs/icu/~checkout~/icu/source/data/mappings/convrtrs.txt http://oss.software.ibm.com/icu/ ISCII is algorithmic. The mapping part to/from Unicode is fairly

Technical Merit vs. Artistic Impression

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 06:32 , David Starner wrote: > On Sat, Mar 16, 2002 at 05:53:04AM +0900, Dan Kogai wrote: >> As an engineer I fully second your opinion. As a person whose name >> once compromised by the government, I have to note it may be >> politically incorrect to do so just

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Kenneth Whistler
Dan Kogai continued: >For instance, > > http://www.horagai.com/www/moji/int/kasiwa.htm > >reports that in Kashiwa, Chiba, a typical suburban city with > population about 210,000, some 21,587 people needed character that was > not listed in JIS. This long interview seems to be about,

Re: Collation - last character?

2002-03-15 Thread Markus Scherer
How about U+10? It is a non-character, which gives it a high (unassigned character) weight in the UCA. It is the highest code point = "the last character". It cannot be a Private-Use character, so few people will be tempted to tailor it to something other than its default UCA weight. It als

Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 06:08 , Kenneth Whistler wrote: > http://www.horagai.com/www/moji/code4.htm > > is a rather out-of-date diatribe against Unicode, dated 1997 (but > possibly > touched a little since then), by Kato Koiti, a known Unicode detractor. > It is flogging the truly dead ho

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 05:34 , John H. Jenkins wrote: > Can you document this? You know, there's a prize offered for the first > person to document the existence of someone whose Japanese name cannot > be represented by Unicode 3.2. Hey, who's offering the price? Would you tell me

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 06:00 , Jungshik Shin wrote: >> situation. There are so many Watanabe-sans, Saito-san, and others >> whose >> name cannot be spelled in Unicode. > > and any of Japanese character sets, right? Sadly yes. Japanese government should definitely spend more resour

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread John H. Jenkins
On Friday, March 15, 2002, at 01:25 PM, David Starner wrote: > On Sat, Mar 16, 2002 at 05:33:30AM +0900, Dan Kogai wrote: >> And the whole discussion of Tengwar sounds to >> me like Webster, Oxford, or American Heritage should add "kwijibo" into >> their vocabulary because Bart Simpson has used

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Ben Monroe
John H. Jenkins wrote: > On Friday, March 15, 2002, at 11:38 AM, Dan Kogai wrote: > > > There are so many Watanabe-sans, Saito-san, and others whose name cannot > > be spelled in Unicode. > > > > Can you document this? You know, there's a prize offered for the first > person to document the exis

Re: Private Use Agreements and Unapproved Characters

2002-03-15 Thread Peter_Constable
On 03/13/2002 01:33:36 PM "John H. Jenkins" wrote: >> There is ICU, which handles most of the backend stuff. If you want to >> get Apple or Microsoft to donate a renderer or fonts such that everyone >> can use them, that would be greatly appreciated >OTOH FreeType is, I hear, working on OpenType

Re: Collation - last character?

2002-03-15 Thread Asmus Freytag
At 11:13 AM 3/15/02 -0800, you wrote: >Once again, if you want a *character* to >correspond to that highest weight, then you have to tailor the >table to do so. But then, of course, you could assign any character >you want to have that highest weight value, including a private >use character or ev

Re: XSLT to convert data

2002-03-15 Thread Peter_Constable
On 03/15/2002 05:15:12 AM "Florian Boehme" wrote: >Hi there, > >is anyone able to tell me, if it is possible to use XSLT to convert data >from one XML schema, to another one, e.g. ebXML to cXML? OT for this list, but that's not the first time... This is a two-part question: 1) XSLT is certain

Re: Manyoushuu and Unicode, (was Re: Synthetic scripts, etc)

2002-03-15 Thread Kenneth Whistler
> > Can anyone with a bit of Kanji knowledge check this page: > > http://etext.lib.virginia.edu/japanese/manyoshu/AnoMany.unavailable.html > > and tell whether all of the "missing" Kanji are in Unicode 3.2? A bunch > of them are shown as not available in the Dai Kanwa, but I'm under th

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread David Starner
On Sat, Mar 16, 2002 at 05:33:30AM +0900, Dan Kogai wrote: > And the whole discussion of Tengwar sounds to > me like Webster, Oxford, or American Heritage should add "kwijibo" into > their vocabulary because Bart Simpson has used (I am relieved to find > there is no "Unidict" !) Rather more li

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Kenneth Whistler
Dan Kogai said: >Sorry. I am just a network consultant by trade (and Just Another Perl > Encode Hacker by accident :) and I know of these classical writing no > more than you do. I am just repeating what those who KNOW have told me. Or those who *CLAIM* to know. >If you can grok Jap

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Jungshik Shin
On Sat, 16 Mar 2002, Dan Kogai wrote: >For instance, there are at least 31 (official) way to spell 'Wata' of > Watanabe, a very popular Japanse family name. Only a couple of which is > in JIS0208-1990, one of many charsets Unicode based upon. Well, in this This does not necessarily m

Manyoushuu and Unicode, (was Re: Synthetic scripts, etc)

2002-03-15 Thread Rick McGowan
As long as we're on the subject of Han... [EMAIL PROTECTED] wrote: > Well, I can even give up on classical writings (after all my knowledge > on classical writings, East or West, is too limited to discuss in > depth). But it strikes me to face the fact that some of you can't even > spell yo

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread John H. Jenkins
On Friday, March 15, 2002, at 11:38 AM, Dan Kogai wrote: > There are so many Watanabe-sans, Saito-san, and others whose name cannot > be spelled in Unicode. > Can you document this? You know, there's a prize offered for the first person to document the existence of someone whose Japanese nam

Re: processing numeric strings

2002-03-15 Thread Markus Scherer
[EMAIL PROTECTED] wrote: > I've got a question I asked about on a couple of other lists, but didn't > get much response, so I thought I'd try here. > > One of our developers has asked me for input on a certain problem: "Do I > need to be able to work with numbers represented using digits/numbe

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 05:12 , <[EMAIL PROTECTED]> wrote: > I have to hasten to add that I have no knowledge of Eastern classics, > unfortunately. I just jumped to the conclusion that the > above-mentioned would have been one of those cases where new Chinese > characters were created

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread David Starner
On Sat, Mar 16, 2002 at 04:38:16AM +0900, Dan Kogai wrote: > Only a couple of which is > in JIS0208-1990, one of many charsets Unicode based upon. And what about Unicode? JIS X0208 is merely a subset of Unicode. Have you actually looked through Unified Han and Han Extension A and B? > IMHO,

RE: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread jarkko . hietaniemi
: : > Tengwar be added BEFORE Ciao-Ciao's poetries and Man-Yo-Shu become : : > encodable in Unicode. : : : : Why should a great Western author have to wait on a great Eastern author? : : My reaction exactly: why author A's special characters (*) would be more special : than author B's special c

RE: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread jarkko . hietaniemi
> IMHO, Unicode Consortium should make these people happy before Tolkien > fans and Trekkies. This is true. But unless someone formally proposes the missing characters, they won't appear in the Unicode. Us Tolkien fans cannot be blamed for Japanese standards organisations, I think :-)

RE: Collation - last character?

2002-03-15 Thread Lars Kristan
Kenneth Whistler wrote: > In the Unicode Collation Algorithm (UTS #10), there is no explicit > weight assigned corresponding to , but a primary weight > assignment of 0x is guaranteed to be higher than that of > any Han character. Well, then I am proposing to introduce such a character. U+FFF

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 02:58 , Rick McGowan wrote: > This line that Everson quoted from Kogai made my ears prick up: > >> ... if Tengwar be added BEFORE Ciao-Ciao's poetries and >> Man-Yo-Shu become encodable in Unicode. > > Huh? Can you send us a list of the precise characters lacking in

RE: Missing values in mapping-tables?

2002-03-15 Thread Lars Kristan
> > In fact, there are no characters defined in ISO 8859-8 for those code > points. If you encounter 0xBF in text that purports to be ISO 8859-8, > it is an error. > Another example showing that it would be very useful to have 128 (possibly 256) codepoints that would be reserved for such purpos

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Dan Kogai
On Saturday, March 16, 2002, at 03:45 , <[EMAIL PROTECTED]> wrote: > : > I confess I enjoyed this thread of whether Tengwar should be > include > : > in Unicode. It's fun. It's cute. But isn't this too much for > those > : > who accepted the compromise for UNIcode? Tengwar should wait til

Browser support

2002-03-15 Thread Magda Danish (Unicode)
-Original Message- Date/Time:Fri Mar 15 12:22:10 EST 2002 Contact: [EMAIL PROTECTED] Report Type: General question Text of the report is appended below: -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- I am trying to establish text style guides for new web site and wish to

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread John H. Jenkins
On Friday, March 15, 2002, at 10:29 AM, David Starner wrote: > In any case, we aren't talking live languages here. The set of > Chinese and Japanese ideographs in modern use have been encoded. The > characters left are obscure historical ones. > > Would it were so. There are still characters mi

Re: Collation - last character?

2002-03-15 Thread Kenneth Whistler
Lars Kristan asked: > Is there a character (codepoint), that is guaranteed to be sorted (collated) > after all other codepoints? > > Like: > > _WantThisOneOnTop > Able > Baker > NoMatterWhat > ^WantThisOneOnBottom > ^^and_so_on > > Where _ is the underscore, which is usually collated 'quite hi

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread John H. Jenkins
On Friday, March 15, 2002, at 09:15 AM, Dan Kogai wrote: > You may say this can be resolved by regarding each Kanji not as a > character but a word (lexically speaking this does make sense) then use > some sort of ligature to represent one. That way you can reduce the > number of code poin

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Michael Everson
At 02:15 +0900 2002-03-16, Dan Kogai wrote: >I confess I enjoyed this thread of whether Tengwar should be include >in Unicode. It's fun. It's cute. But isn't this too much for >those who accepted the compromise for UNIcode? Tengwar should wait >till more critical issues are resolved. Man

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread David Starner
On Sat, Mar 16, 2002 at 02:15:22AM +0900, Dan Kogai wrote: > How will Unicode cope with this kind of dynamically > changing character set? Everything I've read says that the vast majority of Kanji are from a limited set. Just like math characters, characters will have to be added every so often.

Collation - last character?

2002-03-15 Thread Lars Kristan
Is there a character (codepoint), that is guaranteed to be sorted (collated) after all other codepoints? Like: _WantThisOneOnTop Able Baker NoMatterWhat ^WantThisOneOnBottom ^^and_so_on Where _ is the underscore, which is usually collated 'quite high'. And ^ is the hipothetical character I am

Re: Collation - last character?

2002-03-15 Thread Michael \(michka\) Kaplan
Since collation depends on the language and not the code point or encoding or anything else, there is no absolute last character that would be the last character in every possible collation? MichKa Michael Kaplan Trigeminal Software, Inc. -- http://www.trigeminal.com/ - Original Message -

RE: processing numeric strings

2002-03-15 Thread Rick Cameron
There are two quite separate issues here: 1) How can one format a numeric value as a string in the localised representation? 2) How can one parse a string in the localised representation into a numeric value? I believe that the standard C/C++ library provides no support whatever for either oper

Re: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Rick McGowan
This line that Everson quoted from Kogai made my ears prick up: > ... if Tengwar be added BEFORE Ciao-Ciao's poetries and > Man-Yo-Shu become encodable in Unicode. Huh? Can you send us a list of the precise characters lacking in Unicode 3.2 to encode the Man-Yo-Shu? That's an important work, a

Re: Synthetic scripts (was: Re: Private Use Agreements and Unappr oved Characters)

2002-03-15 Thread Dan Kogai
On Friday, March 15, 2002, at 08:48 , Marco Cimarosti wrote: > O, no! At least one of them has a (super)natural origin: CJK ideographs > came > carved on the shell of a gigantic turtle which appeared in dream to Cang > Jie. :-) That reminds me of a fact that Hanzi (or Kanji in Japanese) is eq

processing numeric strings

2002-03-15 Thread Peter_Constable
I've got a question I asked about on a couple of other lists, but didn't get much response, so I thought I'd try here. One of our developers has asked me for input on a certain problem: "Do I need to be able to work with numbers represented using digits/numbering systems other than the Europea

Re: Regd- ISCII to Unicode Converter!

2002-03-15 Thread Michael \(michka\) Kaplan
Actually, the code pages in the 5 are not simply table based, they are algorithm-based. MichKa Michael Kaplan Trigeminal Software, Inc. -- http://www.trigeminal.com/ - Original Message - From: "Doug Ewell" <[EMAIL PROTECTED]> To: <[EMAIL PROTECTED]> Cc: "Michael (michka) Kaplan" <

Re: Regd- ISCII to Unicode Converter!

2002-03-15 Thread Doug Ewell
Michael (michka) Kaplan <[EMAIL PROTECTED]> wrote: > Not sure which converter you are referring to? Windows 2000 and Windows XP > support code pages to do the conversion via MultiByteToWideChar: > > ISCII Assamese 57006 > ISCII Bengali 57003 > ISCII Devanagari 57002 > ISCII Gujarathi 57010 > ISCI

Re: Regd- ISCII to Unicode Converter!

2002-03-15 Thread Michael \(michka\) Kaplan
Not sure which converter you are referring to? Windows 2000 and Windows XP support code pages to do the conversion via MultiByteToWideChar: ISCII Assamese 57006 ISCII Bengali 57003 ISCII Devanagari 57002 ISCII Gujarathi 57010 ISCII Kannada 57008 ISCII Malayalam 57009 ISCII Oriya 57007 ISCII Panja

Regd- ISCII to Unicode Converter!

2002-03-15 Thread Rajesh Chandrakar
Dear All, Sorry to say that I lost the web address some one posted on forum concerning to ISCII to Unicode Conversion. It would be highly appreciated, if some one provides me. I wanted to check the conversion, how far it works? regards, rajesh

Code Page 850 (Multilingual) Fix for MS Windows NT 4.0

2002-03-15 Thread Toor, Indie
I encountered a problem with the Code Page settings for MS Windows NT 4.0 within MS-DOS. The Code Page was set to 437 and this unabled me to use certain ALT codes, e.g. ALT-0248 which should show 'ø', but instead it showed 'o'. This is due to the default Code Page set for MS Windows NT 4. The re

Arabic typography

2002-03-15 Thread Michael Everson
Reposted from the TYPO-L list - Chairman of the Board of Directors- Society of Typographic Aficionados http://www.typesociety.org --- TypeCon2002 July 12-14 Toronto http://www.typecon2002.com -- Dear Sirs, Allow me to introduce myself: My name i

Re: Encoding Elvish scripts may be the road to Mordor?

2002-03-15 Thread Michael Everson
At 13:08 -0800 2002-03-14, Michael \(michka\) Kaplan wrote: >My point was that the connotation of mystical, imaginary things, things not >of the real world, is inescapeable. Even *in* The Lord of th Rings, Sam and >Boromir are both shown as examples of Hobbits and Men who know little of >elves an

RE: Synthetic scripts (was: Re: Private Use Agreements and Unapproved Characters)

2002-03-15 Thread Marco Cimarosti
Doug Ewell wrote: > Sampo has just articulated my favorite argument about so-called > "artificial" scripts. All writing systems are created by man; they do > not occur in nature, like mountains and trees and cats, [...] O, no! At least one of them has a (super)natural origin: CJK ideographs came

XSLT to convert data

2002-03-15 Thread Florian Boehme
Hi there, is anyone able to tell me, if it is possible to use XSLT to convert data from one XML schema, to another one, e.g. ebXML to cXML? A reply is greatly appreciated. Thank you! Kind regards, Florian Mit freundlichen Grüßen, Florian Boehme Consultant / Berater SourcingContent GmbH K