See
http://www.hclrss.demon.co.uk/unicode/fonts.html
which goes through each subrange and lists (free or almost free) fonts that
provide support.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: "Nat Langs" <[EMAIL PROTECTED]>
To
ISCII is not used on the web, mostly. However, some pages are in Unicode.
Refer to
http://www.trigeminal.com/index.asp?1081
for one example.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: "Tom Emerson" <[EMAIL PROTECTED]>
To: "
;Antoine Leca" <[EMAIL PROTECTED]>
> To: "Unicode List" <[EMAIL PROTECTED]>
> Sent: Tuesday, October 31, 2000 07:49
> Subject: Re: Fonts that support the ORNL rendering of Tamil?
>
>
> > Michael (michka) Kaplan wrote:
> > >
> > >
> Monotype have the glyph in its repertoire:
> http://www.monotype.com/non-latin/wt_glyphs/gr_tamil.html>
>
Yep -- but they do not seem to use it any of their fonts!
> The bottom line of the discussion on this subject some weeks ago was that
> it should be available as an option, so that the stan
It is more than reasonable to assume he is referring to scripts/subranges.
He actually was answered, mostly with questions for more information. He has
not yet responded, unless it was privately?
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Messa
Well, it does if you run it and create a big table of info, such as:
http://www.trigeminal.com/samples/localeinfo.asp
doesn't it? :-)
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>;
Replace "tusk" with "trunk" :-)
> Since Tamil has been encoded in Unicode, the ORNL rendering (which is
> described in 9.6.5 of TUS 3.0) has been the only one "officially"
described,
> although most fonts, from Arial Unicode MS to Latha and others seem to not
> support it. I just tried Code2000
Most of this happens to be in the Windows NLS database. See GetLocaleInfo in
MSDN for details:
http://msdn.microsoft.com/library/psdk/winbase/nls_34rz.htm
Or more specifically, LCTypes like LOCALE_SGROUPING for this function,
listed at
http://msdn.microsoft.com/library/psdk/winbase/nls_8rse.htm
Since Tamil has been encoded in Unicode, the ORNL rendering (which is
described in 9.6.5 of TUS 3.0) has been the only one "officially" described,
although most fonts, from Arial Unicode MS to Latha and others seem to not
support it. I just tried Code2000 and several other fonts suggested to me,
t
Hello Shawn,
Only if you are willing to completely ignore Kanji (translation: NO, it
cannot). It would be roughly equivalent from a "size" perspective of asking
whether English could be represented if you skipped all the letters C
through X. :-(
michka
a new book on internationalization in VB a
well, most of the post is opinion but I agree with a lot of it so that's
okay. :-)
> This came with Win 95. Unfortunately MS was competing with IBM's
> OS/2 at the time and wanted good performance on a 4MB system. While they
> use Unicode internally they dropped Unicode support from the user AP
Actually, for the most part the Win9x code base has more in common with its
16-bit cousins than its Unicode brothers NT and Win2000. It is very
non-Unicode in many cases very 16-bit based, unfortunately.
Think of the Win9x 32-bitness being a bit like painting your house. It may
look pretty but it
ROTECTED]>
To: "Michael (michka) Kaplan" <[EMAIL PROTECTED]>
Sent: Tuesday, October 24, 2000 12:10 AM
Subject: Re: Help
> Dear Michael
>
> I send you a demo of this problem with this mail.The default.htm has a
> simple search form.
> It passes a farsi stri
Can you give some more detail, such as where you are trying to do this from
(i.e. from ASP, JSP, etc.) and maybe some of the script you are using? Is
the default system locale of the web server Arabic?
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original
When they say Windows 95, they actually mean ALL versions of Win9x... Win95,
Win98, and Millenium.
If you want to be using the Unicode tools, you must be using a Unicode
operating system... NT4 or Windows 2000.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
-
When they say Windows 95, they actually mean ALL versions of Win9x... Win95,
Win98, and Millenium.
If you want to be using the Unicode tools, you must be using a Unicode
operating system... NT4 or Windows 2000.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
-
Even in Windows 2000 where notepad support UTF-8 and wil try to auto-detect
it, BOM-less files will not be assumed to be UTF-8 if there is no reason why
it cannot be represented as a non-Unicode text file using the default system
code page.
If notepad was being used, and it was saved with a BOM,
t;Ted Peck" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Cc: "'Michael (michka) Kaplan'" <[EMAIL PROTECTED]>
Sent: Friday, October 13, 2000 9:32 AM
Subject: RE: A binary that runs on Win9X and WinNT
> Pardon the delay in my respo
Thats okay. Even if it could be. most Klingon speakers would not understand
it anyway. :-)
michka
Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/
- Original Message -
From: "Michael Everson" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Friday, Oc
Markus,
I assume that Chris was referring to the fact that there were not yet
surrogate pairs (language tags notwithstanding) that were defined until
Boston and Athens.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: "Markus Sche
I would sooner rip my lungs out then speak ill of a much needed technology,
but one drawback to Cheops is that *all* Unicode Win32 API calls go through
Cheops, even when you are running on NT4 or Windows 2000. At least that was
true in the version they were promoting at the last Unicode conferenc
From: "Chris Pratley" <[EMAIL PROTECTED]>
> 3. The application teams are not in the business of Win32 developer
support
> (that is what Windows does)
Well, its actually the Platform SDK, which does draw a lot from Windows. But
developer tools are another division, entirely (per the new structure
on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: "Erik van der Poel" <[EMAIL PROTECTED]>
To: "Michael (michka) Kaplan" <[EMAIL PROTECTED]>
Cc: "Unicode List" <[EMAIL PROTECTED]>
Sent: Wednesday, October
From: "Lars Marius Garshol" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Wednesday, October 11, 2000 1:19 PM
Subject: Re: A binary that runs on Win9X and WinNT
> | Microsoft has a white paper on writing a Unicode app for all
> | platforms, see the articles section on
> | htt
Microsoft has a white paper on writing a Unicode app for all platforms, see
the articles section on http://www.microsoft.com/globaldev/ for details.
Note that applications such as Word, Excel, and Access in Office 2000 use
such a model.
michka
a new book on internationalization in VB at
http://
You can look at http://www.trigeminal.com/ which has a few pages in Arabic,
Farsi, Hebrew, and English. Arabic pages are at:
http://www.trigeminal.com/frmrpt2dap.html?1025
http://www.trigeminal.com/frmrpt2dap_readme.htm?1025
At a minimum, the language list comes from a database; other pages have
UTF-8 supports everything that UTF-16 does. They are two different encodings
of the same set of code points.
michka
- Original Message -
From: "George Zeigler" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Thursday, October 05, 2000 3:15 PM
Subject: UTF-8 and UTF-16
And of course just so people who are not familar with the PSDK can know what
this means the Platform SDK is updated four times a year. So the "next
release" does not refer to the 1-2 year cycle of a typical product. It comes
out at least four times a year (more often with things like VS releas
There are three "settings" for a page that you care about on the ASP pages.
Once the page is compiled, no matter how it is saved, it will be using
UCS-2/UTF-16. Getting in and out of there is the tricky part.
1) The @CODEPAGE directive, which can be put at the top of the page, is the
code page to
The assumption here is that the function will be run on Unicode text.
Therefore, the various industrial and other code pages are irrelevant.
Microsoft does not convert the characters it has in the control code range
to those same code points in Unicode, does it? Indeed, a MultiByteToWideChar
call
Well, I actually give two workarounds for getting UTF-8 working on IIS 4.0:
1) Install the Option Pack after the Service Pack (circumvents the Microsoft
"turn off the evil UTF-8" fix!)
2) Convert it yourself and then use Response.BinaryWrite (the Microsoft
method that says "leave my @*^$ string
://www.i18nWithVB.com/
- Original Message -
From: "Sandeep Krishna" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Cc: "Jianping Yang" <[EMAIL PROTECTED]>; "Simon K. Wong"
<[EMAIL PROTECTED]>; "Bob Verbrugge" <[EMAIL PR
Sandeep,
No, that is not what you need to do at all. How data is sent down to the
client from the server is entirely dependent on what you are doing on the
server.
For example, if you are using a Windows 2000 server, you can set the
Session.CodePage property to 932 for Japanese, 949 for Korean,
And of course, if you are explicitly setting fonts on the page in CSS, you
do not have to worry as much about people seeing the pages, anyway. I know
this is generally considered somewhat evil by some web designer types, but
you can put in several choices and reasonably hope to choose at least one
From: "Roozbeh Pournader" <[EMAIL PROTECTED]>
> On Tue, 3 Oct 2000, Michael (michka) Kaplan wrote:
>
> > But I will forward this all on and see if maybe some understanding of
> > justifications can be reached. Outright agreement or conversion of
opinio
nader" <[EMAIL PROTECTED]>
To: "Michael (michka) Kaplan" <[EMAIL PROTECTED]>
Cc: "Unicode List" <[EMAIL PROTECTED]>
Sent: Tuesday, October 03, 2000 8:08 AM
Subject: Re: lag time in Unicode implementations in OS, etc?
>
>
> On Tue, 3 Oct 2000, Mich
From: "Mark Davis" <[EMAIL PROTECTED]>
> If there are specific areas where the BIDI algorithm has flaws, that
should
> be communicated to the UTC bidi subcommittee, ideally with a proposal to
fix
> the problem.
My understanding is that this was done? Before my time at UTC meetings, mind
you, so
For people who want the source to this page, feel free to e-mail me.
michka
Michael Kaplan
Trigeminal Software, Inc.
http://www.trigeminal.com/
> You can see the Mozilla behavior by going to the following URL:
>
> http://64.38.165.18/FarsiVsArabic.asp
>
> This takes two Arabic locales and one
From: "Roozbeh Pournader" <[EMAIL PROTECTED]>
> On Mon, 2 Oct 2000, Michael (michka) Kaplan wrote:
>
> > The slash character "/" is in the Unicode Bidi algorithm classified as
> > a "European Separator" which means that text would
> > be
No, its not that at all. It is just that many products have a long history
of connection with the people who use the product who also happen to have a
bidirectional language as their native one. Many other products have a
development team with that expertise.
One example can be found in Mozilla (
Hello Rami,
I am not sure I understand your question. If the phone uses the Unicode
standard, then you do not need the hex codes for Unicode characters; you
simply need the pages in question (HTML, XML, or otherwise) to be in a
Unicode encoding such as UTF-8.
Having a file in such an encoding is
ome out earlier in 1999 it would have been
> difficult to implement late in the development cycle unless the changes
were
> minor.
>
> Carl
>
> -Original Message-
> From: Michael (michka) Kaplan [mailto:[EMAIL PROTECTED]]
> Sent: Monday, October 02, 2000 10:15 AM
I do not think Excel 2000 will handle these files properly. The Jet 4.0 text
IISAM will, for what its worth.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: "Peck, Jon" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Se
There is no central way to do it in all applications, no. In fact, most do
not support a way to do it per application. :-(
It gets better or worse in Windows 2000 (depending on your point of view) as
it grabs whole clusters when you work per letter, thus Tamil TTA + I and
other ligatures are sele
Windows NT's latest version, Windows 2000, does not support Unicode 3.0.
There are many scripts for which no keyboards exist, and which do not even
have fonts or shaping rules for rendering.
When it comes out is a generic question, so I will give you a generic
answer: when they get the work done!
From: "George Zeigler" <[EMAIL PROTECTED]>
> When? 5 years from now? As for using Unicode without realizing it,
what
> do you mean? If a Russian's browser is set to CP1251, what happens if the
site
> is in Unicode? At present he gets garbage. I've tried the setting that
> automatically c
Hello Sandeep,
One important issue you have to account for there: if you are using ASP and
your script is writing to a database, then UTF-8 is not involved in your
script. ASP *code* is always in UCS-2/UTF-16 encoding. Period. If you set
the @CODEPAGE/Session.CodePage to 65001 it will affect data
From: <[EMAIL PROTECTED]>
> That's a good question. Internet Explorer 5 is relatively advanced in the
> area of handling different encodings. It seems to honor the encoding
> ("charset") as advertized in HTTP headers, and it seems to try to make
> an educated guess (based on the actual content of
- Original Message -
From: "Yung-Fong Tang" <[EMAIL PROTECTED]>
> Does Netscape6 on Win2K have this problem ? If so, can you put together
> a test page for us? We know there are problem when we try to select the
> conjuncts. However, since we use TextOutW, in theory the TextOutW should
>
Sandeep,
Can you explain exactly what you are doing to get the data from ASP into the
Oracle database? Perhaps post the ASP code? Like most scriptoing languages,
VBScript and JScript both support UCS-2, and it is really usually the Oracle
ODBC or OLE DB driver that has the job of converting the t
From: "Chris Pratley" <[EMAIL PROTECTED]>
> For whatever reason, it is pretty hard to obtain the
> second [South Asia] one, since it sells only in
> Thailand, India, and neighboring countries.
I have been able to arrange getting it to specific customers who ask through
their own premier suppor
Correct, but SA Word 2000 uses a slightly different dll to do the job (it
sits in saext.dll). I was told that it ships in all versions for Word 10 by
a usually reliable source who might be able to chip in with a more thorough
explanation of what this dll does for Word that goes beyond the MS
Searc
Actually, I have been running the South Asia version of Word 2000 for some
time now, and have no problems at all with Thai word breaking and other
issues (I originally installed it for the Vietnamese "VNI" conversion
tools). This worked well both on US NT4 and on US Windows 2000.
The Thai Word br
Hello Magnus,
The problem here is that VB's forms package and most of its controls are not
Unicode enabled. Thus anything not on the default system code page will be
converted to a question mark.
The only real workaround for this is a Unicode control, such as the MS Forms
controls (which cannot
From: "Elaine Keown" <[EMAIL PROTECTED]>
> I'm interested in using the more recent Unicode Hebrew versions on Web
sites. These versions have about 30 more symbols for Hebrew Bible text than
the original Unicode from the early 90s.
>
> But the UTF-8 versions I found on the Web only seem to have t
From: "James Kass" <[EMAIL PROTECTED]>
> > IE 5.5 support all of the Unicode Indian scripts.
> > I just tried it on a couple of Devanagari sites
> > because the English Windows comes with mangal
> > true type font.
>
> May we see links to some of those pages?
Here are a few such pages:
http://w
This is due to the fact that there is simply no font linking info in either
IE or Windows 2000 to handle certain scripts. If you explicitly use
particular fonts (say Slyfaen or Code 2000) then it works very well.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
From: "Antoine Leca" <[EMAIL PROTECTED]>
> The argument is not very strong, since ISO/IEC 10646 is the preferred code
page
> to use for 15897; and furthermore the fact that it is code page based
allow
> to write e.g. a Klingon locale, which may be quite difficult to do if
Unicode-
> based (one ha
It is not that simple... what if someone else registers the domain that uses
the common orthographic variants?
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Cc: <[EMAIL
Not just there, but certainly for languages such as Turkish that have
different casing rules for the letter "I" (lots of other languages, like
Latin Azeri). Without a fuzzy searching logic, some strings would never be
found!
AFAIK these are not forbidden characters, but since I and i cannot alway
Of course, C++ has case sensitive identifiers, so this argument does not
necessarily make as much sense
FWIW I agree with them on the cantillation marks... who is going to be
chanting their code during code reviews? :-)
If one is case sensitive, then those problems go away, but then fuzzy
se
A noble thought, Mike. But how exactly would you suggest legislating the
feeling of what is important in the minds of others?
My overall impression is that people ask here because they are looking for
the slant that they would get from this group. And lets face it... if there
were not other local
From: "John Cowan" <[EMAIL PROTECTED]>
> > Besides I can not
> > take any standard that implements i-klingon as a human language too
> > seriously.
>
> Why not? Human beings speak it (some more fluently than others), and
> write texts in it. Just follow the links from www.kli.org. It is not
>
Well, to cover THAT level of variation, there is only the Ethnologue that I
have ever seen. But the specific question was about language differences
that ISO *can* cover.
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
- Original Message -
From: "Carl W. Br
Most seem to be okay with the addition of the country/region tag from
ISO-3166 for determing the difference between languages spoken in several
places -- this is usually what is done for English, Arabic, Portuguese,
French, and Chinese, as well.
Under Windows, they just tack on a new sublanguage
From: "John Cowan" <[EMAIL PROTECTED]>
> On Sat, 16 Sep 2000, Doug Ewell wrote:
> > SRC is the code for 'Bosnian', 'Croatian', and 'Serbo-Croatian', which
> > means that there is a many-to-one mapping from ISO 639-1 'bs', 'hr',
> > 'sr' to Ethnologue 'SRC'. This is likely to cause much more wides
rom: "Gary Deleel" <[EMAIL PROTECTED]>
To: "'Michael (michka) Kaplan'" <[EMAIL PROTECTED]>
Sent: Thursday, September 14, 2000 6:17 AM
Subject: RE: compliance with sql
> Michka,
>
> Well thank you for your reply. I guess I did forget to spec
Understandable... but you might want to start by taking smaller bites, such
as one major language platform at a time? :-)
I'll start with the one I know more about... Visual Basic. There is really
not much to think about, since the language tries to hide the details from
you. The strings you work
From: "Antoine Leca" <[EMAIL PROTECTED]>
Subject: Re: Tamil glyphs
> We agree this is an area where we really need some light, and a firmer
guide
> of implementation from the Unicode consortium. What is the way to request
> a more strong rule of interpretation?
I will be submitting something fo
Gary,
Do you mean Sybase SQL Server or Microsofy SQL Server? And in either case,
what version do you mean?
SQL Server is a product that spans many versions and two different
companies, and the answer would be different for each of them. Further
detail could allow someone to be helpful here. :-)
FWIW, This is indeed the collation supported by LCID 0x0C0A (3082) under
Windows, names "Spanish - Modern" under most versions of Windows and
"Spanish - International" under Windows 2000.
The other collation is "Spanish - Traditional" and its LCID is 0x040A
(1034).
All functions under Windows th
Well, there are two collations... the "traditional" one and the "modern" or
"international" one. The primary difference between them being that just
about all of the exceptions in the traditional one have been elminated in
the modern one.
This was not a recfent change though, it was several years
Hmmm, I hope this is tongue in cheek, the math flashbacks are scary here!
We could have some fun in the future with "hostile characters", "psychotic
characters", "apathetic characters", and other variations. :-)
michka
a new book on internationalization in VB at
http://www.i18nWithVB.com/
You have stated that you are using FoxPro, but not what data access method
you are using. FoxPro is not Unicode, but it does have a code page setting
that has a table level granularity, and most data access methods such as
ODBC >=3.7 respect the setting in conversions to Unicode.
Also, what metho
Actually, characters would never be *required* to equal bytes in non-Unicode
situations take DBCS, for example. UTF-8 becomes another multibyte
encoding.
It is *easier* and sometimes required for things like buffer size to pass a
count of bytes. But there are also times that you might care ab
From: <[EMAIL PROTECTED]>
Actually, this is an interesting possibility... I was under the impression
that ZWNJ and ZWJ did not allow the usual vowel reordering. If they do, then
I suppose it is just a matter of finding a suggested standard for font
designers to use for the support of the two forms
From: "Antoine Leca" <[EMAIL PROTECTED]>
> Sorry, what is "the other form"? As I see things, in Tamil Nadu the
current
> use is write NNAI exactly the same as, for example, KAI (that is, without
> the "elephant-trunk" form that TUS appears to require).
There are two forms, with and without the "
pport UTF-8 but they chose not to since they had that LCID-based model.
:-(
michka
- Original Message -
From: "Mikko Lahti" <[EMAIL PROTECTED]>
To: "Michael (michka) Kaplan" <[EMAIL PROTECTED]>; "Unicode List"
<[EMAIL PROTECTED]>
Sent: Th
I do not think there is one, actually.
Are you looking to convert files or strings you send via the command line?
michka
- Original Message -
From: "Mikko Lahti" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Thursday, September 07, 2000 6:31 PM
Subject: Win32: Comman
NS 4.x is simply not very good at this sort of thing. The only real solution
is to use an encoding that will support the characters, such as UTF-8.
michka
- Original Message -
From: "Gary P. Grosso" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Thursday, September 07,
ly for the purposes here. :-(
michka
- Original Message -
From: "Antoine Leca" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Thursday, September 07, 2000 2:22 AM
Subject: Re: Tamil glyphs
> Michael (michka) Kaplan wrote:
> >
&g
From: "Rick McGowan" <[EMAIL PROTECTED]>
> > However, it cannot currently be handled by Unicode. You must choose the
> > proper font to display NNA AI, NNNA AI, LA AI, or LLA AI. The Monotype
font
> > and Latha in Windows 2000 are the way that my client got both display
types.
>
> I suppose if you
Monotype font
and Latha in Windows 2000 are the way that my client got both display types.
michka
- Original Message -
From: "Rick McGowan" <[EMAIL PROTECTED]>
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Wednesday, September 06, 2000 2:39 PM
Subject: Re: Tam
From: "Rick McGowan" <[EMAIL PROTECTED]>
> > There are other characters too in the Tamil alphabet that cannot
> > be represented.
>
> These are probably ligatures; the basic alphabet is certainly represented.
Have you read the block introduction on Tamil in Unicode 3.0?
There is one thing missing
d code from a web-page must be changed from its
> UTF-8 encoding to UCS-2 for storage in SQL server? If so are there any
> converters out there?
> Can UCS-2 be used as the encoding for a web-page, or must conversion be
done
> between the two encodings.
>
> >From: "Mic
em
with
> > this - possibly it doesn't recognise the character that it should be
> > converting (Just a mad stab in the dark)
> >
> > Because UTF-8 is a sequence of bytes, does that mean that it could be
> > treated and stored as ASCII, and that the sequence wou
Subject: Re: Unicode in web pages
> I am using JSP on the server side, and am using the TomCat server.
>
>
> >From: "Michael \(michka\) Kaplan" <[EMAIL PROTECTED]>
> >Reply-To: "Michael \(michka\) Kaplan" <[EMAIL PROTECTED]>
> >To: "Steph
UTF-8 is indeed the characterset you want to use for the page encoding;
although some browsers will support UTF-16, etc., not all will.
But the real issue has to do with what technology you are using to connect
to the db. Is it ASP on the server side? Or something else? And what is the
server?
m
To: "Unicode List" <[EMAIL PROTECTED]>
Sent: Sunday, September 03, 2000 2:18 PM
Subject: RE: Same language, two locales
> Michael (michka) Kaplan wrote:
> >
> > Note that Word 2000 will let you tag text with different languages and
> > assuming you have the
For completeness sake, I will mention the Windows behavior here:
There is no way to extend locale IDs under Windows beyond what they are, but
it is possible to change the settings of the ones you choose at any time, in
order to customize them. Thus you can change things such as formats as much
as
From: "Sean O Seaghdha" <[EMAIL PROTECTED]>
> I know this is now rather off-topic, but if some kind soul would like to
> explain to me how to add the necessary NLS information to Windows NT
4/Win2000
> off list, I would be very grateful. I have figured out how to add
languages
> for proofing, so
Due to some massive confusion between Michael Kung, myself, and several
people on the SQL Server team, the incorrect statement was made that SQL
Server 2000 supports UTF-16 and is surogate aware. This is NOT true.
SQL Server 2000, like SQL Server 7.0, supports UCS-2. As such, it is
surogate safe.
And then if you look at the Windows platform, the supported languages are
assigned a locale ID, a number that is documented in Platform SDK as
containing information about language, country, and sorting information.
The locale ID can be used for many purposes, the most important being
collation.
Notice that under Windows, a GetLocaleInfo call with the 0x414 and 0x814
LCIDs for the LOCALE_ISO3166CTRYNAME will return 'NO' in both cases
(LOCALE_ISO639LANGNAME will return 'no' in both cases).
This seems like a Windows bug if there are indeed ISO3166 tags that are
different for nyorsk and bok
Other than UTF-8, generally speaking, this does not happen, since many
browsers will not support it. IE 5.01 and 5.5 seem to, for what its worth...
but not even IIS will.
For UTF-8, there are many such pages. One is
http://www.trigeminal.com/samples/provincial.html
and it has links at the botto
Actually, the Platform SDK docs are really inadequate here. Although the
wparam is listed as "Specifies the character set of the new locale" it is
impossible to get past the fact that WM_INPUTLANGCHANGEREQUEST talks about
INPUTLANGCHANGE_SYSCHARSET as meaning "The new input locale's keyboard
layou
admit
"stealing" the layout from Windows 2000, but I believe they got it from
standard Georgian usage, anyway. :-)
michka
- Original Message -----
From: "Michael (michka) Kaplan" <[EMAIL PROTECTED]>
To: "Magda Danish (Unicode)" <[EMAIL PROTECTED]>; &
Windows 2000 actually has a Georgian keyboard layout, and a font that many
people in Georgia (and also I) find more visually pleasing than Arial
Unicode MS (Sylfaen). See
http://www.microsoft.com/globaldev/keyboards/keyboards.asp
for the layout. Note that the keyboard layout and font only suppor
Yes, there is a way to extract this info using GetLocaleInfo. Pass the LCID
or langid with one of the following params:
LOCALE_IDEFAULTANSICODEPAGE
LOCALE_IDEFAULTCODEPAGE
LOCALE_IDEFAULTEBCDICCODEPAGE (win2k only)
LOCALE_IDEFAULTMACCODEPAGE
michka
Michael Kaplan
Trigeminal Software, Inc.
http:
Once again, if collation info is what you want, see
http://www.unicode.org/unicode/reports/tr10/
Beyond that, it is unclear what you are looking for, really. But if you were
to actually read and try to understand that document, I am fairly certain
that one of two things will happen:
1) You will
501 - 600 of 715 matches
Mail list logo