x27;s languages digitally accessible.
What's more, when implementations support emoji features, such as ZWJ
sequences, variation selectors, etc., they are also building necessary
mechanisms for supporting complex scripts (many of which are recently
encoded or on the roadmap).
Addison P
n the industry in thinking that educational
establishments could do a better job of preparing developers with at least the
basics of Unicode, character encodings, and internationalization.
Addison Phillips
Principal SDE, I18N Architect (Amazon)
Chair (W3C I18N WG)
Internationalization is not a
I agree, although I note that sometimes the additional (redundant) specificity
of "non-7-bit-ASCII characters" is needed when talking to people unclear on
what "ASCII" means.
Addison
> -Original Message-
> From: Unicode [mailto:unicode-boun...@unico
What you might be looking for would be the CLDR project’s “exemplar sets” (see
for example [1]), which describes which characters are customarily used for a
given language and which are sometimes used. However, this is not the same
thing as statistical distribution. One of the points of Unicode
ng from the Universal Character Set as a
whole). Or at least this is how I have already read it.
Addison
> -Original Message-
> From: Unicode [mailto:unicode-boun...@unicode.org] On Behalf Of Richard
> Wordingham
> Sent: Tuesday, May 27, 2014 3:18 PM
> To: unicode@
achieve the desired goal is to use the 'margin' and 'text-align' CSS
styles.
Addison
Addison Phillips
Globalization Architect (Amazon Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
> -Original Message-
> From: U
Actually, that's my bad: I meant to type scalar value.
Stephan Stiller wrote:
On 9/15/2013 3:07 PM, Phillips, Addison wrote:
Not if the limit is counted in characters and not in bytes. Twitter, for
example, counts code points in the NFC representation of a tweet.
"character&quo
Not if the limit is counted in characters and not in bytes. Twitter, for
example, counts code points in the NFC representation of a tweet.
Doug Ewell wrote:
Andre Schappo wrote:
> U+2026 is useful for microblogs when one is looking to save characters
Not if the microblog is in UTF-8, as almos
What kind of document do you mean?
For Web formats (HTML, etc.), the answer is "no".
Addison
Addison Phillips
Globalization Architect (Amazon Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
> -Original Message-
> F
0), you'll be
pointed at "DC00", which is a trailing ("low") surrogate. The pointer needs to
"back up" (decrement) by one to position 3 (0xD800) to find the start of the
character (each of the other code units refers to a single code point).
Addison Phillips
s. There is also the problem that you're unlikely to be
running your software in a Marshallese locale (or have another such external
hint). The real issue is the need for language metadata in file formats and
other places to enable language-specific rendering to take place. Lacking
an't count on it. And far from being dead, other
normalization forms like NFD are useful for various kinds of processing.
Addison
Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
> -Original
up is not available.
Addison
Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
java/lang/Character.html
Or:
http://perldoc.perl.org/5.14.0/perlunicode.html#Unicode-Character-Properties
Or:
http://userguide.icu-project.org/strings/properties
Addison
Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an archit
The block is actually named for it. See:
http://www.unicode.org/charts/PDF/U0080.pdf
This FAQ talks about it:
http://www.unicode.org/faq/blocks_ranges.html
Finally, p217 of the standard actually says so explicitly:
http://www.unicode.org/versions/Unicode6.2.0/ch07.pdf
Addison
ing is depends on what you're using it for. It
is no better, in any case, than an approximation. But it may still be a useful
approximation.
Your nail, tree, or jelly may vary in consistency.
Addison
ript and provide for this (as
well as other use cases).
Cf.
http://norbertlindenberg.com/2012/02/ecmascript-internationalization-api/index.html
Addison
Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N WG)
Internationalization is not a feature.
It is an architecture.
That's for search analysis, not rendering.
Sent from my iPhone
On Oct 8, 2011, at 7:45 AM, "Andreas Prilop" wrote:
> On Fri, 7 Oct 2011, Gerrit wrote:
>
>> So if somebody from Google reads this,
>> [...]
>> Additionally, if the standard Android web browser could then
>> use the html “lang” ta
ntation ("code units"), rather than using non-specific terminology such
as "character".
If you want to use UTF-32, which uses 32-bit code units, one per code point,
you can use a 32-bit data type instead. Those are always 4 bytes wide.
Addison
Addison Phillips
Globalization Arc
with cases such as this. See UAX#15 for more details.
Regards,
Addison
Addison Phillips
Globalization Architect (Lab126)
Chair (W3C I18N, IETF IRI WGs)
Internationalization is not a feature.
It is an architecture.
> -Original Message-
> From: unicode-bou...@unicode.org [mailto:u
Following draft-langtags (and CLDR usage), it would be "tl-Tglg-PH"
Addison
Addison P. Phillips
Director, Globalization Architecture
http://www.webMethods.com
Chair, W3C Internationalization Working Group
http://www.w3.org/International
Internationalization is an architecture.
I
ady written
(please excuse my sarcasm, which isn't directed at you).
PS> Flarn isn't the reference I think it is, is it?
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
http://www.webMethods.com
Chair, W3C Internationalization Working Group
http://
mark? Then list members might be able to comments with precision on your
request.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
http://www.webMethods.com
Chair, W3C Internationalization Working Group
http://www.w3.org/International
Internationalization is an
ding contains about 9,000 characters. Unicode
(and thus its encoding UTF-8) contains about 93,000 characters. There
will not be a mapping to SJIS for every Unicode character.
Hope that helps.
Best Regards,
Addison
Addison P. PhillipsDirector, Globalization
Architecturehttp://www.webMethods.
: RE: My Querry
(B > From: [EMAIL PROTECTED] > [mailto:[EMAIL PROTECTED]]
(B On Behalf Of Addison Phillips [wM] > Sent: Tuesday,
(B November 23, 2004 9:14 AM
(B > One of the nice things about UTF-8 is that the ASCII
(B bytes > from 0 to 7F hex (including the C0 control
(B
to 7F hex. That is, amoung other things
UTF-8 was designed specifically to be compatible with C language strings.
Addison
Addison P. Phillips
Director, Globalization Architecture
http://www.webMethods.com
Chair, W3C Internationalization Working Group
http://www.w3.org/International
website
http://www.inter-locale.com. I can't remember, but I think this one was a substitute
paper at IUC22, so it probably isn't in the program proceedings.
Hope this helps,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
h
sidered for the Working
Group charters. Feedback, suggestions and comments are welcome (please use the [EMAIL
PROTECTED] list for public feedback). Member-only or team-only comments should be
directed to Martin DÃrst or Richard Ishida.
Best Regards,
Addison Phillips, Chair W3C-I18N-WG
Martin DÃr
the average byte count
per character. If you are worried about storage (disk, database, etc.), then the
specifics of your situation will determine what the "right answer" may be for
you.
Best Regards,
Addison
Addison P. PhillipsDirector, Globalization
Architectur
instructions in my little document "Learn to Type Japanese and Other
Languages" here:
http://www.inter-locale.com/whitepaper/LearnToType.pdf
(Acrobat PDF format)
Addison P. PhillipsDirector, Globalization
ArchitecturewebMethods | Delivering Global Business Visibilityhttp://www.webMethods
f you want to find other
fonts, you can use the Character Map utility to explore the character ranges
supplied by various fonts on your system.
Best Regards,
Addison
Addison P. PhillipsDirector, Globalization
ArchitecturewebMethods | Delivering Global Business Visibilit
umber should go first, IMO.
But if the file is in some other format, that's fine, so long as the format is stable.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
http://www.webMethods.com
Chair, W3C Intern
means correct errors. Spelling or nomenclatural (non-substantive) changes in
the descriptions are errata. But I view changes, additions, and deletions to/from the
data tables as changes to the standard and they should, in my opinion, be treated as
such even if they are only to correct errors.
Be
quot;).
So, there are very valid reasons why applications need to transfer locale preferences.
Check out our group's document (and the forthcoming requirements document) and see if
you don't agree... but we should be wary of very broad global statements (both "all
language tags are also
is affected by your runtime
locale). Writing out files as UTF-8 (instead of System.out.println()) will
let you use the native2ascii utility or other programs to investigate the
actual codepoints you are retrieving.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webM
the documentation).
You might want to be aware of the W3C's Internationalization mailing list
(See http://lists.w3.org/Archives/Public/www-international/) and
of the FAQs at http://www.w3.org/International/geo
(alas, the FAQ on this topic hasn't been published yet!)
Best Regards,
..", but it helps to look in the
right places :-).
Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
http://www.webMethods.com
Chair, W3C Internationalization (I18N) Working Group
Chair, W3C-I18N-WG, Web Services Tas
thread "over there" from escaping back to the Unicode list. I HAVE posted a response
to your message to you privately, copy that list.
Thanks again for the comments.
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visib
Regards,
Addison
Addison P. PhillipsDirector, Globalization
ArchitecturewebMethods | Delivering Global Business Visibilityhttp://www.webMethods.comChair, W3C Internationalization
(I18N) Working GroupChair, W3C-I18N-WG, Web Services Task Forcehttp://www.w3.org/InternationalInternationalization
.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
http://www.webMethods.com
Chair, W3C Internationalization (I18N) Working Group
Chair, W3C-I18N-WG, Web Services Task Force
http://www.w3.org/International
Internation
sections
> that deal with UTF-16.
>
I am under the impression that they mean combining sequences.
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
http://www.webMethods.com
Chair, W3C Internationalization (I18N) Working Group
Ch
Pub date = 1 April 2003.
I think that's the salient part.
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
http://www.webMethods.com
Chair, W3C Internationalization (I18N) Working Group
Chair, W3C-I18N-WG, Web Services Task Force
Maybe I should have said it the other way first: yuck. I don't want any
characters beyond the planes ever. Nor a change in encoding rules.
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
432 Lakeside Drive, Sunnyvale, CA, U
27;t all have to muck with our implementations to support this
stuff.
Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
432 Lakeside Drive, Sunnyvale, CA, USA
+1 408.962.5487 (office) +1 408.210.3569 (mobile)
mailto:[EMA
I'm looking for an authoritative mapping table of this old VMS encoding to Unicode and
can't find one anywhere. Anyone got one handy?
Thanks in advance!
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods | Delivering Global Business Visibility
432 Lake
cessing the formatting
fucctions of your programming API correctly. Most programming
environments provide a way to format a value using separate locale rules
(for grouping and decimal separators) and currency.
More information about what you're trying to do would help in
recommending a solu
ny Asian fonts do not contain a full set
of glyphs for these styles. You'll have to substitute the plain fonts in
order to see the characters, but they won't be bold or italic or
what-have-you.
Hope this helps.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architec
don't have them bookmarked on this
machine.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods, Inc. -- "Global Business Visibility"
432 Lakeside Drive, Sunnyvale, CA, USA
+1 408.962.5487 (office)
+1 408.210.3569 (mobile)
mailto:[EMAIL PR
paper that describes some of these issues on the webMethods
website (it's "Creating Multi-lingual and Multi-locale Databases") here:
http://www.webmethods.com/cmb/solutions/solutions_white_paper_email_entry/
Best Regards,
Addison
--
Addison P. Phillips
Director, Globalizati
at 161K. Building updated data tables is actually easier
with the old source code...
In any event, you really ought to try the newer versions of ICU4J out.
They are a lot easier to work with. And a "light" version isn't that
hard to create, if that's what you want.
Best Re
uilding updated data tables is actually easier
with the old source code...
In any event, you really ought to try the newer versions of ICU4J out.
They are a lot easier to work with. And a "light" version isn't that
hard to create, if that's what you want.
Best Regards,
Add
ds as they
evolve. You may also wish to join our public mailing list (public-i18n-ws @ w3.org).
Instructions on joining the Task Force and mailing list are located on our home page,
located here: http://www.w3.org/International/ws.
Best Regards,
Addison
Addison P. Phillips
Director, Globali
o join at any of these
levels of involvement, you should check our how public page here:
http://www.w3.org/International/ws
Best Regards,
Addison
[1] http://www.w3.org/Consortium/Member/List
Addison P. Phillips
Director, Globalization Architecture
webMethods, Inc.
eally.
In this context, UCS-2 vs. UTF-16 is unimportant.
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods, Inc.
432 Lakeside Drive
Sunnyvale, California, USA
+1 408.962.5487 (phone)
+1 408.210.3569 (mobile)
-
Internation
9.pdf
Hope that helps.
Best Regards,
Addison
Addison P. Phillips
Director, Globalization Architecture
webMethods, Inc.
432 Lakeside Drive
Sunnyvale, California, USA
+1 408.962.5487 (phone)
+1 408.210.3569 (mobile)
-
Internationalization
ot;));
If the original byte[] is actually correctly formed and you want to get UTF-8, Marco's
code is correct:
byte[] utf8bytes = myString.getBytes("UTF-8");
Note that I have omitted try/catch blocks for clarity, but the compiler will insist on
them...
Hope that helps.
Bes
Regards,
Addison
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
> Behalf Of Marco Cimarosti
> Sent: Thursday, September 12, 2002 5:12 AM
> To: '[EMAIL PROTECTED]'; '[EMAIL PROTECTED]'
> Subject: RE: Problems converting from
How about "I'll" or "it's".
Regards,
Addison
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
> Behalf Of John Cowan
> Sent: Tuesday, August 20, 2002 4:40 AM
> To: Marco Cimarosti
> Cc: '[EMA
to set up your UNIX environment to display any
Russian characters that you retrieve from the database, so you'll probably end
up setting both (again, UTF-8 locales in UNIX work nicely with Unicode encoded
databases).
Hope
that helps.
Best
Regards,
Addison
-Original Message--
haracters (hence,
this may be the example that proves the rule, although I can't think of
anything else that looks more like a click than a bang or an octothorpe ;-).
Best Regards,
Addison
languages is more a limiter here probably than
active engineering work preventing it.
Addison
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
> Behalf Of [EMAIL PROTECTED]
> Sent: Monday, July 29, 2002 3:02 PM
> To: [EMAIL PROTECTED]
>
question isn't whether something is or isn't possible, but
rather whether something is or isn't a good idea or desirable. If you've had
an orthography since 1937 based on locally available typewriters, then you
probably won't want to change. If you have NO writing tradition, you're
better off avoiding unnecessary headaches, IMO.
Addison
are locale driven. The first stop in creating a
new orthography (or computerizing an existing one, perhaps from the days of the
typewriter), for my money would probably be to get ISO-639 to issue the language a
2-letter code so you can have locale (and Unicode character database) data tagged with
it
ser-def locales, so heck...)
Addison
> -Original Message-
> From: David Starner [mailto:[EMAIL PROTECTED]]
> Sent: Saturday, July 27, 2002 12:07 AM
> To: Addison Phillips [wM]; Curtis Clark
> Cc: [EMAIL PROTECTED]
> Subject: Re: REALLY *not* Tamil - changing scripts (long
a platform with user
defined locales (approximately no platforms conveniently do this).
Obviously I'm not an expert in these linguistic areas (and hence rarely comment on
them), but it seems to me that the lack of other mechanisms makes Unicode an
attractive target for criticism in this
Hi Martin,
I install the Chinese Unicode keyboard myself...
When I confronted this specific problem recently in our products, the main
solution I adopted was to allow \u notation as input (of course, our
products are for developers...)
Hope that helps.
Addison
Addison P. Phillips
://ftp.unicode.org/Public/MAPPINGS/OBSOLETE/EASTASIA/JIS/
~Addison
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
> Behalf Of Stefan Persson
> Sent: 2002年6月15日 8:35
> To: Addison Phillips [wM]; [EMAIL PROTECTED]
> Subject: Re: Shi
ting your own. I say
"natural" because the conversions will be similar to other programs running in that
environment, providing the behavior that users expect.
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 La
>
> Small correction here: MUI is not required in order to change the default
> system locale to Japanese and keep the user interface as English
> on a Win2K
> enu machine.
Not in a Java program ::sigh::. Some parts of Swing and AWT use the user locale,
especially older JREs...
Addison
>
;
your problems also, since your default windows code page will switch to 932 when you
do this. However, you might regard programs that display ??? and mojibake text as
"broken" and gently remind the manufacturer of the defects ;-).
Good luck,
Addison
Addison P. Phillips
Globaliza
and-match
fonts to see all of your characters (this is what Java tries to do and what
IE and Netscape do).
Hope that helps.
Thanks,
Addison
Addison P. Phillips
Globalization Architect
webMethods, Inc.
432 Lakeside Drive
Sunnyvale, California, USA
+1
ng for a living.
Best Regards,
Addison
> -Original Message-
> From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
> Behalf Of Roozbeh Pournader
> Sent: Tuesday, May 28, 2002 5:08 AM
> To: Unicode List; unicoRe
> Subject: To hell with Unicode ;)
>
>
>
> Quotin
Hi Lars,
Some information below...
Addison
Addison P. Phillips
Globalization Architect
webMethods, Inc.
432 Lakeside Drive
Sunnyvale, California, USA
+1 408.962.5487 (phone)
+1 408.210.3569 (mobile)
-
Internationalization is an architecture.
It
Whups. Typo.
Try http://www.inter-locale.com/IUC19.pdf (IUC is International Unicode Conference.
ICU is (IBM) International Classes for Unicode).
Addison
> -Original Message-
> From: J M Sykes [mailto:[EMAIL PROTECTED]]
> Sent: 2002年5月12日 3:29
> To: Addison Phillips
www.inter-locale.com/ICU19.pdf
that may help you a little.
Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3659 (mobile)
rrectly, since the rendering is actually the hard bit.
Somehow I suspect that a BugParade issue would get rejected as "Unicode
3.0++ not supported in JDK1.4", even if everyone on this list went and voted
for it ;-).
Regards,
Addison
Addison P. Phillips
Globalization Architect
webM
;re interested in. But GUI
apps are going to be very problematic until Swing or AWT are fixed.
Hope that helps.
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale
n only obliquely related, but which may
help you with your confusion about 16- or 8-bit flavors of Unicode in databases
(http://www.inter-locale.com/IUC19.ppt)
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Dri
The server changed to www-124.ibm.com
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3659 (mobile
n out to be a number when creating I18n domain names. The
early implementers have mostly ignored these issues and the interplay
between the ultimate standard and existing registrars should be interesting.
Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization En
mpatibility
I hope that helps.
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. | The Business Integration Company
432 Lakeside Drive, Sunnyvale, California, USA
+1 408.962.5487 (p
Shift-JIS..
Note that adding a META tag to your page is a very good idea if you decide
to use UTF-8 as the encoding.
You can see that this works here:
http://www.geocities.com/apphillips2000/index.html
You will note that I included a META tag. Otherwise you have to manually
select UTF-8
t;?", and so on. But you can make things work in most cases with a little
care.
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. | The Business Integration Company
432 Lakeside Drive, Sunnyvale, California, USA
+1
According to Lunde (p. 205), the range is through F9FC. There are real characters in
the range FA40 -> FC4B, at least in CP932, which may be causing you some confusion,
since these have concrete mappings to Unicode (not just a mapping in the U+E000 range).
Best Regards,
Addison
Addiso
..
One exception that I know well (I did some I18N work
for them) is AltaVista. If you go to the search page and click on "Customize
Settings", you can set the search engine to use UTF-8 as your search encoding
(both input and output). The back end is entirely Unicode.
Regards,
Addiso
Hi William,
The third case rests on how the compiler interpreted the string at compile time.
What's the encoding of your source file? What was the encoding of the locale at
compile time?
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods
8 in this case).
If you want to see the locale encoding, you will need to call
wctombs() and then iterate across the char* you get back from that.
Best Regards,
Addison
===
Addison P. Phillips Globalization Archi
Don't forget that file systems and shells have a part to play in your
command line excursions.
Hope this helps.
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3569 (mob
provide Delphi specific information.
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3569 (mobile)
-
In
eyond 1.4, or about a year from now. We'll
know by then what solution has been adopted. It should be interesting.
Best Regards,
Addison
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]]On
Behalf Of Carl W. Brown
Sent: Thursday, October 04, 2001 5:37 PM
Cc: [EMAIL PROTECT
s probably font releated. If you check
out http://www.unicode.org/help/display_problems.html
you will see information on how to turn those infuriating hollow boxes into the
characters you expected to see the first time.
Addison
Addison P. PhillipsGlobalization Architect / Manager,
.Character
and java.lang.String would have to change internal representation or add
methods or something to allow surrogate pairs to be evaluated.
Addison
-Original Message-
From: Yung-Fong Tang [mailto:[EMAIL PROTECTED]]
Sent: Wednesday, October 03, 2001 4:17 PM
To: Addison Phillips [wM]
But then, it's my day to be an idiot...
Of course an int can store more than 16 bits. It's char that's defined at
0..65535 in Java. int's will work fine in the APIs. It's the chars that are
a problem.
Must be the heat. ;-)
Addison
Addison P. Phillips
Globaliz
interesting
problem. Perhaps they'll redefine an int to be a 32-bit value and switch
Java to UTF-32 (yeah, sure.)
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, C
me the ammunition to oppose its adoption in practice).
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3659 (mobile)
ewton" that
looked just like an "N", but with a different semantic meaning prior to the creation
of Unicode. And there should not be one created now, because the letter "N" contains
all of the useful information necessary for that purpose.
Best Regards,
Addison
Addison P.
Hi Sameer,
The "font list" in your JVM comes from the operating
system. The list you see is what the JVM thinks you have installed locally. To
add a font to the list, you have to install the font.
Best Regards,
Addison
Addison P. PhillipsGlobalization Architect
coding. That's "EUC-JP with characters outside EUC-JP
represented as NCRs", and our parser handles that quite well...
Addison
-Original Message-
From: Ayers, Mike [mailto:[EMAIL PROTECTED]]
Sent: Thursday, August 30, 2001 10:00 AM
To: 'Addison Phillips [wM]'
Cc:
apan, use the non-Unicode encodings.
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods, Inc. 432 Lakeside Drive, Sunnyvale, CA
+1 408.962.5487 (phone) +1 408.210.3659 (mobile)
-
I
enerally considered quite rare,
but you should be aware of it as a potential objection that Asian users
might have to a pure Unicode approach.
Good luck with your implementation.
Best Regards,
Addison
Addison P. Phillips
Globalization Architect / Manager, Globalization Engineering
webMethods,
1 - 100 of 194 matches
Mail list logo