David Starner <[EMAIL PROTECTED]> wrote:
>> IMHO, Unicode Consotium should make these people happy before
Tolkien
>> fans and Trekkies.
>
> Why? How does the happiness of Tolkein fans and Trekkies have anything
> do with the happiness of those Japanese? Encoding characters is
largely
> a parell
Lars Kristan <[EMAIL PROTECTED]> wrote:
> Suppose ISO 8859-8 is ever upgraded (even if not likely, but - for the
sake
> of argument). One might say that it would be bad to change an existing
> definition in the table e.g. for 0xBF from 0x2DBF to 0x20AC. But how
is that
> worse from changing it fr
On Sat, Mar 16, 2002 at 07:32:19AM +0900, Dan Kogai wrote:
> On Saturday, March 16, 2002, at 06:32 , David Starner wrote:
> >So people can't get work done, and poets are silenced because of
> >politics. The very concept is something to which I object most strongly.
>
> To me such poets who give
Kenneth Whistler wrote:
> Ben Monroe wrote:
>
> > As it is a personal spelling, I never expected
> > Unicode to map a code point to this character to me.
>
> For those not following the Japanese in the UTF-8, Ben's name
> is Monryuu Ben in kanji. This is a sound-based name coinage
> for an English
On Fri, 15 Mar 2002, Kenneth Whistler wrote:
> Ben Monroe wrote:
> > As it is a personal spelling, I never expected
> > Unicode to map a code point to this character to me.
>
> For those not following the Japanese in the UTF-8, Ben's name
> is Monryuu Ben in kanji. This is a sound-based name co
On Fri, 15 Mar 2002, Jungshik Shin wrote:
> On Sat, 16 Mar 2002, Dan Kogai wrote:
> > (*) My parents wanted me to name me ÷¥ (U+5F48), a classical form, but
> > it was not listed on "the table of Kanjis allowed for names" so I was
> > named U+5F3E.
>
> Frankly speaking, I find it rather har
On Fri, 15 Mar 2002, John H. Jenkins wrote:
> On Friday, March 15, 2002, at 11:38 AM, Dan Kogai wrote:
>
> > There are so many Watanabe-sans, Saito-san, and others whose name cannot
> > be spelled in Unicode.
>
> Can you document this? You know, there's a prize offered for the first
> person
Ben Monroe wrote:
> While I lived in Japan, I always wrote my last name with $BN6(B stuck
> inside of $BLg(B (in the space in the lower middle).
Of course, anyone can do that -- Han is known to be a productive system
and people have alyways played games with characters for fun and
scholar
On Saturday, March 16, 2002, at 07:27 , Kenneth Whistler wrote:
> *What* still holds true? These are just well-worn issues of itaiji
> (variant forms). The characters from the little anime exhibit of
> variants are, in Unicode:
>
> U+9AD8 / U+9AD9
> U+5516 / U+555E
> U+9593 / U+9592
>
> all varian
Ben Monroe wrote:
> As it is a personal spelling, I never expected
> Unicode to map a code point to this character to me.
For those not following the Japanese in the UTF-8, Ben's name
is Monryuu Ben in kanji. This is a sound-based name coinage
for an English name. Mon 'gate' ryuu ~ ryoo 'dragon
ICU supports ISCII, except for the font-style attributes (like "bold") which are not
expressible in plain text.
http://oss.software.ibm.com/cvs/icu/~checkout~/icu/source/data/mappings/convrtrs.txt
http://oss.software.ibm.com/icu/
ISCII is algorithmic. The mapping part to/from Unicode is fairly
On Saturday, March 16, 2002, at 06:32 , David Starner wrote:
> On Sat, Mar 16, 2002 at 05:53:04AM +0900, Dan Kogai wrote:
>> As an engineer I fully second your opinion. As a person whose name
>> once compromised by the government, I have to note it may be
>> politically incorrect to do so just
Dan Kogai continued:
>For instance,
>
> http://www.horagai.com/www/moji/int/kasiwa.htm
>
>reports that in Kashiwa, Chiba, a typical suburban city with
> population about 210,000, some 21,587 people needed character that was
> not listed in JIS.
This long interview seems to be about,
How about U+10?
It is a non-character, which gives it a high (unassigned character) weight in the UCA.
It is the highest code point = "the last character".
It cannot be a Private-Use character, so few people will be tempted to tailor it to
something other than its default UCA weight.
It als
On Saturday, March 16, 2002, at 06:08 , Kenneth Whistler wrote:
> http://www.horagai.com/www/moji/code4.htm
>
> is a rather out-of-date diatribe against Unicode, dated 1997 (but
> possibly
> touched a little since then), by Kato Koiti, a known Unicode detractor.
> It is flogging the truly dead ho
On Saturday, March 16, 2002, at 05:34 , John H. Jenkins wrote:
> Can you document this? You know, there's a prize offered for the first
> person to document the existence of someone whose Japanese name cannot
> be represented by Unicode 3.2.
Hey, who's offering the price? Would you tell me
On Saturday, March 16, 2002, at 06:00 , Jungshik Shin wrote:
>> situation. There are so many Watanabe-sans, Saito-san, and others
>> whose
>> name cannot be spelled in Unicode.
>
> and any of Japanese character sets, right?
Sadly yes. Japanese government should definitely spend more resour
On Friday, March 15, 2002, at 01:25 PM, David Starner wrote:
> On Sat, Mar 16, 2002 at 05:33:30AM +0900, Dan Kogai wrote:
>> And the whole discussion of Tengwar sounds to
>> me like Webster, Oxford, or American Heritage should add "kwijibo" into
>> their vocabulary because Bart Simpson has used
John H. Jenkins wrote:
> On Friday, March 15, 2002, at 11:38 AM, Dan Kogai wrote:
>
> > There are so many Watanabe-sans, Saito-san, and others whose name cannot
> > be spelled in Unicode.
> >
>
> Can you document this? You know, there's a prize offered for the first
> person to document the exis
On 03/13/2002 01:33:36 PM "John H. Jenkins" wrote:
>> There is ICU, which handles most of the backend stuff. If you want to
>> get Apple or Microsoft to donate a renderer or fonts such that everyone
>> can use them, that would be greatly appreciated
>OTOH FreeType is, I hear, working on OpenType
At 11:13 AM 3/15/02 -0800, you wrote:
>Once again, if you want a *character* to
>correspond to that highest weight, then you have to tailor the
>table to do so. But then, of course, you could assign any character
>you want to have that highest weight value, including a private
>use character or ev
On 03/15/2002 05:15:12 AM "Florian Boehme" wrote:
>Hi there,
>
>is anyone able to tell me, if it is possible to use XSLT to convert data
>from one XML schema, to another one, e.g. ebXML to cXML?
OT for this list, but that's not the first time...
This is a two-part question:
1) XSLT is certain
>
> Can anyone with a bit of Kanji knowledge check this page:
>
> http://etext.lib.virginia.edu/japanese/manyoshu/AnoMany.unavailable.html
>
> and tell whether all of the "missing" Kanji are in Unicode 3.2? A bunch
> of them are shown as not available in the Dai Kanwa, but I'm under th
On Sat, Mar 16, 2002 at 05:33:30AM +0900, Dan Kogai wrote:
> And the whole discussion of Tengwar sounds to
> me like Webster, Oxford, or American Heritage should add "kwijibo" into
> their vocabulary because Bart Simpson has used (I am relieved to find
> there is no "Unidict" !)
Rather more li
Dan Kogai said:
>Sorry. I am just a network consultant by trade (and Just Another Perl
> Encode Hacker by accident :) and I know of these classical writing no
> more than you do. I am just repeating what those who KNOW have told me.
Or those who *CLAIM* to know.
>If you can grok Jap
On Sat, 16 Mar 2002, Dan Kogai wrote:
>For instance, there are at least 31 (official) way to spell 'Wata' of
> Watanabe, a very popular Japanse family name. Only a couple of which is
> in JIS0208-1990, one of many charsets Unicode based upon. Well, in this
This does not necessarily m
As long as we're on the subject of Han... [EMAIL PROTECTED] wrote:
> Well, I can even give up on classical writings (after all my knowledge
> on classical writings, East or West, is too limited to discuss in
> depth). But it strikes me to face the fact that some of you can't even
> spell yo
On Friday, March 15, 2002, at 11:38 AM, Dan Kogai wrote:
> There are so many Watanabe-sans, Saito-san, and others whose name cannot
> be spelled in Unicode.
>
Can you document this? You know, there's a prize offered for the first
person to document the existence of someone whose Japanese nam
[EMAIL PROTECTED] wrote:
> I've got a question I asked about on a couple of other lists, but didn't
> get much response, so I thought I'd try here.
>
> One of our developers has asked me for input on a certain problem: "Do I
> need to be able to work with numbers represented using digits/numbe
On Saturday, March 16, 2002, at 05:12 , <[EMAIL PROTECTED]>
wrote:
> I have to hasten to add that I have no knowledge of Eastern classics,
> unfortunately. I just jumped to the conclusion that the
> above-mentioned would have been one of those cases where new Chinese
> characters were created
On Sat, Mar 16, 2002 at 04:38:16AM +0900, Dan Kogai wrote:
> Only a couple of which is
> in JIS0208-1990, one of many charsets Unicode based upon.
And what about Unicode? JIS X0208 is merely a subset of Unicode. Have
you actually looked through Unified Han and Han Extension A and B?
> IMHO,
: : > Tengwar be added BEFORE Ciao-Ciao's poetries and Man-Yo-Shu become
: : > encodable in Unicode.
: :
: : Why should a great Western author have to wait on a great Eastern author?
:
: My reaction exactly: why author A's special characters (*) would be more special
: than author B's special c
> IMHO, Unicode Consortium should make these people happy before Tolkien
> fans and Trekkies.
This is true. But unless someone formally proposes the missing characters, they
won't appear
in the Unicode. Us Tolkien fans cannot be blamed for Japanese standards
organisations, I think :-)
Kenneth Whistler wrote:
> In the Unicode Collation Algorithm (UTS #10), there is no explicit
> weight assigned corresponding to , but a primary weight
> assignment of 0x is guaranteed to be higher than that of
> any Han character.
Well, then I am proposing to introduce such a character. U+FFF
On Saturday, March 16, 2002, at 02:58 , Rick McGowan wrote:
> This line that Everson quoted from Kogai made my ears prick up:
>
>> ... if Tengwar be added BEFORE Ciao-Ciao's poetries and
>> Man-Yo-Shu become encodable in Unicode.
>
> Huh? Can you send us a list of the precise characters lacking in
>
> In fact, there are no characters defined in ISO 8859-8 for those code
> points. If you encounter 0xBF in text that purports to be ISO 8859-8,
> it is an error.
>
Another example showing that it would be very useful to have 128 (possibly
256) codepoints that would be reserved for such purpos
On Saturday, March 16, 2002, at 03:45 , <[EMAIL PROTECTED]>
wrote:
> : > I confess I enjoyed this thread of whether Tengwar should be
> include
> : > in Unicode. It's fun. It's cute. But isn't this too much for
> those
> : > who accepted the compromise for UNIcode? Tengwar should wait til
-Original Message-
Date/Time:Fri Mar 15 12:22:10 EST 2002
Contact: [EMAIL PROTECTED]
Report Type: General question
Text of the report is appended below:
-- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --
I am trying to establish text style guides for new web site and wish to
On Friday, March 15, 2002, at 10:29 AM, David Starner wrote:
> In any case, we aren't talking live languages here. The set of
> Chinese and Japanese ideographs in modern use have been encoded. The
> characters left are obscure historical ones.
>
>
Would it were so. There are still characters mi
Lars Kristan asked:
> Is there a character (codepoint), that is guaranteed to be sorted (collated)
> after all other codepoints?
>
> Like:
>
> _WantThisOneOnTop
> Able
> Baker
> NoMatterWhat
> ^WantThisOneOnBottom
> ^^and_so_on
>
> Where _ is the underscore, which is usually collated 'quite hi
On Friday, March 15, 2002, at 09:15 AM, Dan Kogai wrote:
> You may say this can be resolved by regarding each Kanji not as a
> character but a word (lexically speaking this does make sense) then use
> some sort of ligature to represent one. That way you can reduce the
> number of code poin
At 02:15 +0900 2002-03-16, Dan Kogai wrote:
>I confess I enjoyed this thread of whether Tengwar should be include
>in Unicode. It's fun. It's cute. But isn't this too much for
>those who accepted the compromise for UNIcode? Tengwar should wait
>till more critical issues are resolved. Man
On Sat, Mar 16, 2002 at 02:15:22AM +0900, Dan Kogai wrote:
> How will Unicode cope with this kind of dynamically
> changing character set?
Everything I've read says that the vast majority of Kanji are from a
limited set. Just like math characters, characters will have to be added
every so often.
Is there a character (codepoint), that is guaranteed to be sorted (collated)
after all other codepoints?
Like:
_WantThisOneOnTop
Able
Baker
NoMatterWhat
^WantThisOneOnBottom
^^and_so_on
Where _ is the underscore, which is usually collated 'quite high'.
And ^ is the hipothetical character I am
Since collation depends on the language and not the code point or encoding
or anything else, there is no absolute last character that would be the last
character in every possible collation?
MichKa
Michael Kaplan
Trigeminal Software, Inc. -- http://www.trigeminal.com/
- Original Message -
There are two quite separate issues here:
1) How can one format a numeric value as a string in the localised
representation?
2) How can one parse a string in the localised representation into a numeric
value?
I believe that the standard C/C++ library provides no support whatever for
either oper
This line that Everson quoted from Kogai made my ears prick up:
> ... if Tengwar be added BEFORE Ciao-Ciao's poetries and
> Man-Yo-Shu become encodable in Unicode.
Huh? Can you send us a list of the precise characters lacking in Unicode
3.2 to encode the Man-Yo-Shu? That's an important work, a
On Friday, March 15, 2002, at 08:48 , Marco Cimarosti wrote:
> O, no! At least one of them has a (super)natural origin: CJK ideographs
> came
> carved on the shell of a gigantic turtle which appeared in dream to Cang
> Jie. :-)
That reminds me of a fact that Hanzi (or Kanji in Japanese) is
eq
I've got a question I asked about on a couple of other lists, but didn't
get much response, so I thought I'd try here.
One of our developers has asked me for input on a certain problem: "Do I
need to be able to work with numbers represented using digits/numbering
systems other than the Europea
Actually, the code pages in the 5 are not simply table based, they are
algorithm-based.
MichKa
Michael Kaplan
Trigeminal Software, Inc. -- http://www.trigeminal.com/
- Original Message -
From: "Doug Ewell" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Cc: "Michael (michka) Kaplan" <
Michael (michka) Kaplan <[EMAIL PROTECTED]> wrote:
> Not sure which converter you are referring to? Windows 2000 and
Windows XP
> support code pages to do the conversion via MultiByteToWideChar:
>
> ISCII Assamese 57006
> ISCII Bengali 57003
> ISCII Devanagari 57002
> ISCII Gujarathi 57010
> ISCI
Not sure which converter you are referring to? Windows 2000 and Windows XP
support code pages to do the conversion via MultiByteToWideChar:
ISCII Assamese 57006
ISCII Bengali 57003
ISCII Devanagari 57002
ISCII Gujarathi 57010
ISCII Kannada 57008
ISCII Malayalam 57009
ISCII Oriya 57007
ISCII Panja
Dear All,
Sorry to say that I lost the web address some one posted on forum concerning
to ISCII to Unicode Conversion. It would be highly appreciated, if some one
provides me. I wanted to check the conversion, how far it works?
regards,
rajesh
I encountered a problem with the Code Page settings for MS Windows NT 4.0
within MS-DOS.
The Code Page was set to 437 and this unabled me to use certain ALT codes,
e.g. ALT-0248 which should show 'ø',
but instead it showed 'o'.
This is due to the default Code Page set for MS Windows NT 4.
The re
Reposted from the TYPO-L list
-
Chairman of the Board of Directors- Society of Typographic Aficionados
http://www.typesociety.org
---
TypeCon2002 July 12-14 Toronto
http://www.typecon2002.com
--
Dear Sirs,
Allow me to introduce myself: My name i
At 13:08 -0800 2002-03-14, Michael \(michka\) Kaplan wrote:
>My point was that the connotation of mystical, imaginary things, things not
>of the real world, is inescapeable. Even *in* The Lord of th Rings, Sam and
>Boromir are both shown as examples of Hobbits and Men who know little of
>elves an
Doug Ewell wrote:
> Sampo has just articulated my favorite argument about so-called
> "artificial" scripts. All writing systems are created by man; they do
> not occur in nature, like mountains and trees and cats, [...]
O, no! At least one of them has a (super)natural origin: CJK ideographs came
Hi there,
is anyone able to tell me, if it is possible to use XSLT to convert data
from one XML schema, to another one, e.g. ebXML to cXML?
A reply is greatly appreciated.
Thank you!
Kind regards,
Florian
Mit freundlichen Grüßen,
Florian Boehme
Consultant / Berater
SourcingContent GmbH
K
58 matches
Mail list logo