Microsoft Outlook 2003 that is part of Office 2003 (not the free Outlook
Express) is a very decent Unicode-compliant e-mail application, that also
includes a powerful organizer, PIM etc. The entire Office 2003 is simply an
excellent example of Unicode implementation: very complete, very exhausti
From: "Jörg Knappen" <[EMAIL PROTECTED]>
> I have scanned a sample of the german -burg abbreviature. It is from
> Diercke Weltatlas, 165. Auflage, Georg Westermann Verlag,
> Braunschweig 1972, card page 14.
Very interesting! It would be even more interesting if you told us the URL
so we can actua
Type font development application:
http://groups.msn.com/fontlab/tipsandtricks.msnw?action=get_message&mview=0&ID_Message=3403
I would appreciate comments on the validity of the statements I made there.
Adam Twardoch | http://www.twardoch.com/adam/
Fontlab Ltd. (scripting products & mark
year will fully support Unicode 4.0.x.
Regards,
Adam Twardoch <[EMAIL PROTECTED]>
Scripting Products and Marketing Manager
Fontlab Ltd.
From: "Mark Davis" <[EMAIL PROTECTED]>
> In one sense, the using "Dvorak" in English for "DvoÅÃk" is little
different
> than using "Cologne" in English for "KÃln". Both are transcriptions into a
> form that has become more or less customary.
If at all, "KÃln" is a German and "Cologne" is a French
> >Even "Sputnik" is better known...
Hey, "Sputnik" is actually a cool name for that sign, never heard of it and
never thought of it, but I guess I'll adapt it for my private use :)
Adam
Scientifically speaking, neither "character" nor "script" has a serious
definition in the Unicode standard. That is, there is no methodology that
can be used to determine whether something is a script, or a character,
without arbitrary decisions being involved.
Very recently at the Polish TeX Use
From: "John Hudson" <[EMAIL PROTECTED]>
> 'Careful hairsplitting' always takes place when people care about
typography.
How very true.
On one hand, there's people who put a cedilla under "a" when typesetting
Polish, on the other hand, there's people who adjust the vertical position
of hyphens whe
From: "D. Starner" <[EMAIL PROTECTED]>
> As far as I can tell, there are no documents that
> use both of these ligated th's, so the cases aren't parallel.
Is the lack of knowledge of a certain individual a sufficient criterion for
deducting absence?
Adam
> By that logic, both the proposed 0246 and your i.t.a. ligature can be
> unified with U+00F0, which is the character used in the IPA convention
> to represent a voiced th in English.
Indeed. And 01C6 could be unified with 006A because they both are used to
represent the same sound in two differen
From: "Peter Kirk" <[EMAIL PROTECTED]>
> Just wait a few years! Then at least you will care that the fonts are
> properly hinted.
It's possible that in a few years, the LCD monitors will have a higher
resolution so that larger font sizes (in pixels) will be used for typical
screen presentation. Mo
> But can someone explain to me why a ligature such as ct which CANNOT be
> accurately decomposed into individual characters (at least, it can't if
> it's designed PROPERLY) shouldn't be encoded in its own right?
I don't think the design of the glyph has anything to do with the issue of
encoding.
- Original Message -
From: "Asmus Freytag" <[EMAIL PROTECTED]>
> Ligatures in use in Fracktur are entirely missing. For example, ch is
> a mandatory ligature there.
It doesn't matter whether a ligature is "mandatory" or not. Ligatures should
not be encoded _at all_, and these encoded in
> The name Unicode is now in mailboxes around the world - is this a good or
> bad thing?
I guess neither good nor bad. Most people don't know what Unicode is and
most people don't English anyway, so I guess they won't really care what the
message says.
Adam
Michael, Peter, and others:
Thank you for the responses.
I agree with Michael that the simplistic approach I have envisioned would be
rather incomplete -- I'm willing to accept that limitation. I am aware of
many issues involving IMEs, "chaining" dead keys etc. I would be willing to
leave them ou
mmers) 0105AltGr+A
WindowsPolish Polish (Programmers) 0041A
...
where I could, for example, look up which exactly keyboard layouts let the
user input, say, a with acute, and how he can do that.
Thank you in advance,
Adam Twardoch
> > For the same reason, why is the German "ess-tsett" (sharp S) given a
> > compatibility decomposition as instead of ?
>
> Don't know. But there are instances of sharp s (Ã) that look like a
ligated
> long-s (Å) and ezh (Ê).
That is correct.
Before a consistent spelling using "Ã" was introduce
From: "John Hudson" <[EMAIL PROTECTED]>
> No, the core fonts in Windows were all hinted for b/w rendering, because
> that is the rendering that MS used when those fonts were developed. MS are
> not currently shipping any fonts that are specifically hinted for
ClearType
> rendering, although that w
; -- they're no different than base 17 or base 22 numbers.
Seriously: since no writing system "natively" uses hexadecimal digits
(except for a bunch of crazy programmers), there is no reason encoding them.
As I mentioned beofre, in international mathematrics, they are represented
From: "Michael Everson" <[EMAIL PROTECTED]>
> >[EMAIL PROTECTED]
This merely means that somebody has a virus who had both Michael and Roozbeh
in his/her address book.
People who believe that e-mails with a particular name in the From field
must come from that very person can be called, ehem, naii
> From: Doug Ewell [mailto:[EMAIL PROTECTED]]
> On a somewhat related note, here's a utility I'd like: something that
could
> look inside a TrueType or OpenType font and tell me what Unicode code
points
> it covers (i.e. has one or more glyphs for).
Try FontExpert 2003 for Windows:
http://www.pro
> > How to get unicode values for special characters in Java?
> > I have a set of Czech special characters?
Use SC Unipad: http://www.unipad.org/
You can paste Unicode text into it or convert from any other encoding using
the File / Import option. Then, you can use Edit / Select All and Edit /
Co
From: "Chris Jacobs" <[EMAIL PROTECTED]>
> Does unicode have a code point for the former dutch currency symbol?
> It would really surprise me if it had, we don't have dutch currency any
more, so what would we use it for?
If De Telegraaf publishes a DVD edition of their complete archive, and all
t
> seems to say that the code point for it is
> 025219 (decimal)
> which I think would be U+6283 (hex)
Shouldn't it rather be U+0192 (Alt+0402)?
Adam
Frank da Cruz wrote:
> > By the way, the German phrase is mine. I seem to have discovered a
German
> > word (the name of a town, Ã"echtringen) that has an acute accent. It's
> > listed in the Postleitzahlenbuch:
It looks very much like a "ghost" acute accent that would probably turn out
as a pr
From: "Radovan Garabik" <[EMAIL PROTECTED]>
> Just a wild guess: isn't the acute accent used to indicate that
> Oe does not form a digraph Ö, but is to be pronounced separately?
This "acute" really looks like a printing error to me, not like a real
acute. But if you were to pronounce o-e distnict
Do you think it would make sense (or maybe this probposal has already been
filed) to introduce tagging for OpenType layout features into Unicode? So
far, no tagging for OTL tags exists.
Adam
I'm looking for reference/guidelines on positioning of Vietnamese
diacritics. Any hints?
Adam
> Do we again need an intelligent font that understands language tagging?
This should be achievable with OpenType, no?
> Do we now have different flavors of Unicocde, one for English, one for
Icelandic, one for French, one for German ... ?
In most of the cases described be you, you can still hav
> I just installed inDesign 1.5 and noticed that it doesnt support Unicode
> characters (Pasting from W2k's CharMap and using Keyman).
>
> Can anybody tell, please, which version of inDesign do support Unicode,
> if any?
Antonio,
InDesign 1.5 does support Unicode, it just supports neither pastin
Does anyone know an English-language summary/overview of typesetting rules
in Spanish (in particular: quotation and punctuation marks, dashes), or
could somebody summarize them to me please?
Thank you in advance,
Adam
Michael, John, Kenneth,
thank you very much for the prompt answers. This is exactly how I thought it
is (except for 02CA, where I wasn't sure).
Adam
I have a very basic question. What would be the implementation differences
of diacritics marks in a font? For example, we'd consider:
U+00B4 acute accent
U+02CA modifier letter acute accent
U+0301 combining acute accent
What are the common recommendations regarding the glyphs in a font
(TrueType
It's not exactly Unicode-related, but should be interesting for people
working with multilingual applications.
I've just discovered a great online demo of ScanSoft's text-to-speech
technology. I've tested it with Polish, German, Russian, French and English,
and it all delivered amazing results!
> Some people might find this exhibit of interest. I don't have any web
> address, but I received the following announcement (edited for brevity).
Rick,
this is indeed quite interesting.
The website of the Gutenberg Museum in Mainz is:
http://www.gutenberg.de/museum.htm
There's no information
- Original Message -
> > How many characters does the current version of the Unicode Standard
> > enumerate?
>
> 95,156.
Oh, thank you! I needed that figure to make a point why you cannot make a
single TrueType font covering all of the Unicode range. I knew it was way
more than 65,536, b
How many characters does the current version of the Unicode Standard
enumerate?
BTW: I think this information would be useful if it were always included in
the summary of earch revision.
Adam
> A number of people asked me for HTML or PDF versions of the Unicode
> Myths slides. Since I didn't want to pop $300 to do PDF versions,
(...)
Why $300? http://www.pdf995.com/download.html is free.
@
> And where can i find the charts. i searched the unicode site, but in vain.
Ehem... Go to http://www.unicode.org/ and click on "Code Charts"?
There, browse all the sections with "CJK" in the name.
@
- Original Message -
From: "Marcin 'Qrczak' Kowalczyk" <[EMAIL PROTECTED]>
> I don't understand. I'm talking about characters above U+, not
> about characters from the range U+D800..DFFF. They are represented
> as themselves in UCS-4. But the said routine represents them as pairs
> of
Dear Ken,
thank you for your reply. This sheds some light onto what I've discovered so
far. Especially the information that CID font 0 won't work on Windows
NT/2000! This was actually unclear to me. Can you please confirm which
latest Windows ATM versions for which Windows systems do support the
> It looks like a great solution for folks with dedicated servers. My
problem
> however, is that my provider will not allow server extensions.
I recollect Borware talking about an option to have them host websites.
Might be an interesting opportunity.
Adam
e specialists on that list who might be
willing to respond on just one or two items from my list -- because that
would happen to be their specialty area.
Regards,
Adam Twardoch
able in English language and may be purchased by customers in Europe
and North America.
I would mostly appreciate explicit answers, but also links, references,
corrections and additions to my understanding sketched above etc.
Thank you in advance,
Adam Twardoch
From: "Carl W. Brown" <[EMAIL PROTECTED]>
> If you always render each language in its own language
> you will have real font problems. This is where graphic representations
> come in handy.
You can always use Fairy, and have multilingual text and no font problems at
all :)
( http://fairy.borware
e European languages,
for a start.
Technically, a list of language tags (whatever system) plus a
Unicode-to-language tag mapping list would do perfectly. Collating info
would be a welcome addendum...
Adam Twardoch
re there some standardized "transliteration" mechanisms for such
situations, or are those conversions Microsoft-specific?
Adam Twardoch
47 matches
Mail list logo