xkcd: ‮LTR

2012-11-21 Thread Deborah Goldsmith

http://xkcd.com/1137/ 

Finally, an xkcd for Unicoders. :-)

Debbie



Re: Best smart phones apps for diverse scripts?

2010-10-29 Thread Deborah Goldsmith
iPhone 4 supports Unicode in SMS messages. Furthermore, the SMS standard 
provides for Unicode in messages:

http://en.wikipedia.org/wiki/SMS

I haven’t encountered any problems sending Unicode SMS messages on ATT in the 
US.

Debbie

On Oct 29, 2010, at 8:13 AM, Ed wrote:

 That's an interesting question Don.
 
 I recently bought a so-called ChiPhone (Chinese phone) which has
 message catalogs and input methods for English, Français, Español,
 Português, Italiano, Deutsch, Bahasa Melayu, Bahasa Indonesia, Türçe,
 Tiếng Việt, русский язык, Arabic, Persian, Romanian, ไทย, 繁體中文 and of
 course 简体中文.
 
 The phone has a side slide-out QWERTY keyboard which is very
 convenient.  The input method for 简体中文 is decent enough.  However,
 overall the software on the phone sucks, and a number of the other
 language input methods are awkward or bordering on unusable.  The lack
 of Japanese is also annoying.
 
 And there is another big problem: at least here in the U.S., it looks
 like at least some major carriers refuse to accept Unicode text
 messages outside of the ASCII range.  I wish I knew more specifically
 what is or is not accepted.  I know I have had problems trying to send
 Chinese text messages with T-Mobile: the carrier refused to accept
 messages containing symbols.  Very annoying.
 
 Does anyone on this list know specifically what limitations carriers
 in the U.S. impose on unicode SMS messages?  Are there specific
 encoding issues?
 
 I think it would be especially valuable to know if the iPhone4 using
 ATT in the U.S. deals with Unicode properly?  The reason I single out
 the iPhone4 is because its high-resolution screen is very much
 superior to a typical smart phone, especially when it comes to reading
 scripts with many strokes like Chinese, or with many small diacritical
 marks, like Vietnamese or Thai.  (If you have not done so yet, try
 reading a Chinese web page on your typical smart phone, and then do
 the same on an iPhone4 to see the difference).
 
 - Ed
 
 On Fri, Oct 29, 2010 at 8:20 AM, Don Osborn d...@bisharat.net wrote:
 What do users of this list find to be the most Unicode friendly smart
 phones? Apps for those phones? Best input systems for texting beyond ASCII
 (and potentially multiscriptly)?
 
 
 
 Thanks in advance for any feedback. I’m back in the US and in the market for
 a new phone, and if I pay for high-end, don’t want to be limited to ASCII.
 
 
 
 Don
 
 





Cake Wrecks: My Thai Font

2010-06-05 Thread Deborah Goldsmith
http://www.cakewrecks.com/2010/06/my-thai-font.html

It’s not often you get computers and wedding cakes in the same post…

Debbie



Re: Cake Wrecks: My Thai Font

2010-06-05 Thread Deborah Goldsmith
If you’re amazed by that, you probably don’t read Cake Wrecks regularly. ;-)

Debbie

On Jun 5, 2010, at 12:30 PM, Clark S. Cox III wrote:

 
 On Jun 5, 2010, at 11:33 AM, Deborah Goldsmith wrote:
 
 http://www.cakewrecks.com/2010/06/my-thai-font.html
 
 It’s not often you get computers and wedding cakes in the same post…
 
 I'm actually amazed that the decorator went through the trouble to accurately 
 reproduce the mojibake gibberish.
 
 -- 
 Clark S. Cox III
 clarkc...@me.com
 



Re: Nicest UTF

2004-12-04 Thread Deborah Goldsmith
On Dec 3, 2004, at 2:54 AM, Andrew C. West wrote:
I strongly agree that all Unicode
implementations should cover all of Unicode, and not just the BMP, and 
it really
annoys me when they don't; but suggesting that you need to implement 
supra-BMP
characters because they are going to start popping up all over the 
place is
wrong in my opinion (not that Doug suggested that, but that's my 
extrapolation
of his point). Software developers need to implement supra-BMP 
characters
because some users (probably very few) will from time to time want to 
use them,
and software should allow people to do what they want.
Actually, about 10% of the glyphs in the Japanese fonts that ship with 
Mac OS X are represented by characters in plane 2. The main reason they 
are there is because they are used in names (people, places, and 
companies). So there are real customers who want to use characters 
outside the BMP. I would not characterize it as very few. That's true 
of the vast majority of SMP characters, but not all of them.

Deborah Goldsmith
Internationalization, Unicode Liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Looking for a C library that converts UTF-8 strings from their decomposed to pre-composed form

2004-11-08 Thread Deborah Goldsmith
I think he's saying he wants to convert to NFC *from* Mac OS X data, in 
which case the fact that Mac OS X's file system normalization is not 
strict NFD doesn't really matter. Also, he says he's running on 
Solaris, which would make it a tad difficult to call a Mac OS X API. 
ICU should do the trick.

It's worth pointing out that there is no such thing as precomposed 
Unicode. Normalization form C (NFC) could be called as precomposed as 
possible. There are some sequences of Unicode that can only be 
expressed using combining marks.

Deborah Goldsmith
Internationalization, Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Nov 8, 2004, at 5:17 PM, Markus Scherer wrote:
Tay, William wrote:
Is there any C library available that converts the decomposed UTF-8 
byte
streams into the pre-composed equivalent?
MacOS X does decompose filenames, but it does not use standard Unicode 
normalization (because it was
designed before Unicode's normalization was finalized.) I suggest you 
search the mailing list
archive for this list for more details. You probably need to use a 
MacOS system function.

ICU has options for normalization (some defined with internal 
constants only) which may or may not
match, or get close to, MacOS filename normalization: 
http://oss.software.ibm.com/cgi-bin/icu/nbrowser

markus



Re: Grapheme clusters

2004-10-05 Thread Deborah Goldsmith
UAX 29 provides for language-specific tailoring of break behavior, and 
this seems like a situation where you'd want grapheme break to be 
tailored. See section 3 of UAX 29 for a discussion of this.

Which language are we discussing here?
Deborah Goldsmith
Internationalization, Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Oct 5, 2004, at 9:09 AM, Chris Harvey wrote:
Ysgrifennodd Christopher Fynn [EMAIL PROTECTED] ar y 05-10-2004 am 10:42:
It would of course be possible to have these pair combinations 
replaced by a single ligature glyphs using the locl feature in 
OpenType under a specific language tag.
Are ligatures what Im looking for? The letters of the consonant 
cluster like kw are not joined together visually in any way. Also, 
when I use OpenType for ligatures like ffi st etc. the parts of the 
ligature are deleted one at a time. I think the UAX#29 or UAX#28 
discusses the differences between grapheme clusters and ligatures.

Is there a locale setting for this language? - Many applications now 
automatically tag documents with the current input locale.
There is no locale setting. The Native American language has a very 
small speaker base.

You can tell these people that the PUA is no real solution since you 
can get some very unexpected glyphs displayed for PUA characters. 
(Microsoft   Windows automatically maps a bunch of non-BMP CJK 
characters to PUA codepoints and sometimes these will display instead 
of the glyphs in your font.
In fact, I have been continuously sending them lists of all the 
reasons not to use PUA, the CJK problem being only one of many. The 
problem is, they feel the PUA solves their two biggest issues, 
backspacing the clusters and collation, without realising that a whole 
new and bigger batch of problems arises.

Thanks,
Chris Harvey
--
Gwlad heb iaith, gwlad heb galon
www.languagegeek.com





Re: Problem with accented characters

2004-08-23 Thread Deborah Goldsmith
FYI, by far the largest source of text in NFD (decomposed) form in Mac 
OS X is the file system. File names are stored this way (for historical 
reasons), so anything copied from a file name is in (a slightly altered 
form of) NFD.

Also, a few keyboard layouts generate text that is partly decomposed, 
for ease of typing (e.g., Vietnamese).

Deborah Goldsmith
Internationalization, Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Aug 23, 2004, at 11:51 AM, Doug Ewell wrote:
Problem with accented charactersWilliam Tay wrote:
Can anyone explain why an accented character is sometimes represented
as a base character plus its accent?  For example, the utf-8
representation for é is 65 CC 81, which is the utf-8 representation
for e and the accent, instead of C3 A9?  I find that this is how MacOS
X represents accented characters.
The two characters U+0065 and U+0301 (é) are canonically equivalent to
the single character U+00E9 (é).  That is, the two-character combining
sequence is supposed to be considered equivalent to the single
precomposed character.  Apparently MacOS X, or at least one application
running under it, does use the combining sequence.
How can a C application that receives such utf-8 encoded characters
handle them correctly?  Appreciate your comments.
It must understand normalization.  See TUS 4.0, section 5.6 for more
information.
-Doug Ewell
 Fullerton, California
 http://users.adelphia.net/~dewell/





Re: MacOS character sets

2004-07-12 Thread Deborah Goldsmith
The native character set for Mac OS X is Unicode. Earlier versions of 
Mac OS used Apple-proprietary character sets, and some applications 
still use those character sets on Mac OS X, though their use is 
deprecated.

The mappings for Apple's old character sets are available at:
http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/
Deborah Goldsmith
Internationalization, Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Jul 12, 2004, at 7:39 AM, Tay, William wrote:
Hi,
 
I'd like to understand what character encoding an application that 
runs on MacOS uses.  Just as Windows applications generally use code 
pages and UNIX applications use ISO-8859-X character set, what about 
MacOS applications?
 
Is there any website that shows the encoding of characters of the 
MacOS character sets?
 
Thanks.
 
Will 




Hebrew modifier question

2004-05-24 Thread Deborah Goldsmith
I'm in the process of grooming some data for the CLDR 1.1 release and 
have run into an issue with use of a modifier letter in Hebrew.

There appears to be a usage of a modifier letter or punctuation to 
annotate transcriptions of non-Hebrew words. This is appearing in the 
country and language data. Here are some examples using U+0027 
APOSTROPHE:

AZ { ' }
CL { ' }
CZ {  ' }
GS {  ''   '  }
cs { ' }
I have two questions:
1. Is this considered punctuation or a modifier letter? I.e., would the 
proper character come from U+2xxx (punctuation) or U+02xx (modifier 
letters)?

2. What is its proper typographic shape? Is it really a straight mark 
like U+0027, or does it look like U+2019, U+2018, or something else?

I'd appreciate any information anyone has on this mark.
Thanks,
Deborah Goldsmith
Internationalization, Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Panther PUA behavior

2004-02-05 Thread Deborah Goldsmith
On Feb 4, 2004, at 4:42 PM, Dean Snyder wrote:
Doing font substitution on PUA code points was causing problems,
because we have found a lot of fonts have garbage entries in their
cmaps in the PUA, due to the implementation details of certain
font-editing applications (which use the PUA part of the cmap as a
scratch area).
Did this cause real problems for users?
Yes. There are an alarming number of fonts out there that have garbage 
entries in the PUA region of their cmap. Not to mention the GB 18030 
and HK-SCS fonts that have PUA entries that *cannot* be removed until 
the relevant characters are encoded in Unicode (don't get me started on 
that decision...). We cannot decree what font vendors do: the fonts are 
out there, and some of the foundries that produced them aren't even in 
business any more. We cannot force end users to upgrade their entire 
font collection to eliminate less-than-squeaky-clean fonts (there would 
be a lot of empty Fonts folders). Users tend to consider such problems 
to be OS problems, not font problems, however unfair that may be.

The system was basically picking a random font in many
cases. Rather than have a font substitution mechanism that picks a
random font, we disabled font substitution for PUA characters.
But I suggest that that randomness can actually be controlled by 
users
- by font installation or code point re-assignment.
We always try to keep the needs of non-expert users foremost. We 
continue to refine our implementation of font substitution; we will 
keep this issue in mind and look for ways to accommodate expert users 
of the PUA.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Panther PUA behavior

2004-02-04 Thread Deborah Goldsmith
On Feb 2, 2004, at 9:20 PM, Dean Snyder wrote:
I hope Apple re-thinks this, because it makes PUA useless in plain 
text.
Doing font substitution on PUA code points was causing problems, 
because we have found a lot of fonts have garbage entries in their 
cmaps in the PUA, due to the implementation details of certain 
font-editing applications (which use the PUA part of the cmap as a 
scratch area). The system was basically picking a random font in many 
cases. Rather than have a font substitution mechanism that picks a 
random font, we disabled font substitution for PUA characters.

I agree this causes problems for things like file names in the Finder, 
but I worry whether that is an appropriate use of PUA characters. When 
Cuneiform is encoded, all these file names will have to be reentered.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: corporate/users PUA ranges (was: Cuneiform - Dynamic vs. Static)

2004-01-14 Thread Deborah Goldsmith
On Jan 13, 2004, at 7:10 PM, Philippe Verdy wrote:
For Panther:
user part of PUA == everything not in the corporate part of the PUA
corporate part of PUA == the set of corporate PUA characters that
Apple defines on Mac OS
Which is?
Obviously, it's:
http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/CORPCHAR.TXT
So I wonder how Panther can do want you said:
FYI, Panther was changed to not do font substitution in the user part
of the PUA (it still does it in the corporate part).
without using some arbitrary limit in the middle of the PUA range of 
the
BMP, by assuming that Apple has not assigned (will not assign) 
corporate
characters in these PUA code points.
Because the assignments in the corporate area only change when we 
release a new version of Mac OS, not in between. The set of corporate 
characters is fixed for Panther.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Cuneiform - Dynamic vs. Static

2004-01-14 Thread Deborah Goldsmith
On Jan 13, 2004, at 7:38 PM, Doug Ewell wrote:
Sorry to belabor this, but:

Based on
http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/CORPCHAR.TXT, this
zone could start at U+F700 if you include the NeXT function key
mappings, or U+F800 (actually U+F803) if you don't.  Should we assume
that Apple draws the line at U+F700?  The UTC has always been careful
never to draw the line themselves.
It's always best to work up from the bottom of the PUA. If you want 
your font to work on Mac OS X, it's best to avoid the values in the 
file you cite. Apple adds new corporate PUA characters very 
infrequently; we try to avoid PUA characters at all costs.

Beyond those statements I can't make any guarantees...

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Cuneiform - Dynamic vs. Static

2004-01-14 Thread Deborah Goldsmith
On Jan 13, 2004, at 7:38 PM, Doug Ewell wrote:
Sorry to belabor this, but:

Based on
http://www.unicode.org/Public/MAPPINGS/VENDORS/APPLE/CORPCHAR.TXT, this
zone could start at U+F700 if you include the NeXT function key
mappings, or U+F800 (actually U+F803) if you don't.  Should we assume
that Apple draws the line at U+F700?  The UTC has always been careful
never to draw the line themselves.
It's always best to work up from the bottom of the PUA. If you want 
your font to work on Mac OS X, it's best to avoid the values in the 
file you cite. Apple adds new corporate PUA characters very 
infrequently; we try to avoid PUA characters at all costs.

Beyond those statements I can't make any guarantees...

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Cuneiform - Dynamic vs. Static

2004-01-13 Thread Deborah Goldsmith
FYI, Panther was changed to not do font substitution in the user part 
of the PUA (it still does it in the corporate part). This was because 
different fonts can use the same PUA code point for different things 
(and do; this was not a hypothetical problem but one we have seen in 
practice). The idea going forward is that use of PUA code points needs 
to be accompanied by an explicit font specification. Picking the first 
font you find for a PUA code point does not seem like the right 
approach to us.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Jan 13, 2004, at 1:23 PM, Dean Snyder wrote:

  (RESPONSE) Not being a font designer, I called a font designer friend
of mine and he DID say there are tool problems and operating system
problems associated with non-code-point-specified glyphs in OpenType. 
He
specifically mentioned Volt and FontLab. For what it's worth, I have 
seen
a difference between Jaguar and Panther in how Mac OS X treats 
characters
in the PUA - in Panther they commonly show up with the indeterminate
glyph symbol even when a suitable font, that worked in Jaguar, is 
installed.




Re: Cuneiform - Dynamic vs. Static

2004-01-13 Thread Deborah Goldsmith
On Jan 13, 2004, at 4:58 PM, Philippe Verdy wrote:
Where is the limit between the user part of the PUA and the 
corporate
part of the PUA?
For Panther:
user part of PUA == everything not in the corporate part of the PUA
corporate part of PUA == the set of corporate PUA characters that 
Apple defines on Mac OS

Also what is the status of planes 15 and 16? are they all in the user 
part
Actually, I don't think planes 15 and 16 were covered by this change. 
Time to write a bug...

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Panther PUA behavior (was RE: Cuneiform - Dynamic vs. Static)

2004-01-13 Thread Deborah Goldsmith
On Jan 13, 2004, at 5:47 PM, John Hudson wrote:
If you have an existing document that already specifies a font, it 
should be fine. I believe Deborah's point is that Apple will no longer 
try to guess the best font to use when PUA codepoints are entered 
using another font or are found in a plain text document.
Yes, that's exactly right.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: New MS Mac Office and Unicode?

2004-01-12 Thread Deborah Goldsmith
On Jan 6, 2004, at 11:48 AM, Tom Gewecke wrote:
MS Mac Office 2004 was announced at MacWorld SF today.  Does anyone 
know
whether this update finally brings the Unicode capabilities of the 
WinXP
version to the Mac OS X world?
I can now tell you that Mac Office 2004 does offer enhanced support for 
Unicode, in that it can input, edit, and display Unicode characters 
that are not part of any Mac OS legacy character set. I can't say yet 
to what extent the various components of Office support complex shaping 
behavior or bidirectional scripts (e.g., Arabic, Thai, Hindi), because 
I don't know. However, at the very least you will see access to 
expanded CJK repertoires, and access to languages like Icelandic and 
Greek.

Mac Office 2004 will also include fonts with larger repertoires than 
previous versions of Mac Office.

Here are some highlights from the PR information I received:

- Can input, print, and display more than 30 languages
- Larger font repertoires (e.g., Arial 296 glyphs - 1192, MS Mincho 
~9000 glyphs - 16,031)
- Japanese fonts included (MS P Mincho and Gothic)

I've been told more details will be discussed in the coming months 
before MS Mac Office 2004 is released. Perhaps some of the Microsoft 
folks on this list can add more details? :-)

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: MS Windows and Unicode 4.0 ?

2003-12-04 Thread Deborah Goldsmith
Microsoft Office v. 10 for Mac OS X uses the same document format as 
Office for Windows, so the documents are indeed stored as Windows. 
However, since Office v. 10 uses Apple's obsolete QuickDraw API set for 
text rendering, it cannot render text that is not in a Mac OS legacy 
character set (e.g., MacRoman, MacJapanese, etc.). There have been APIs 
available since Mac OS 8.5 for rendering Unicode text directly, but the 
applications Michael Everson mentions are not using them.

In Office v.10, characters it can't render via QuickDraw show up as 
underscores. They're still there, but you can't see or edit them.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Dec 4, 2003, at 9:41 AM, Raymond Mercier wrote:

 Is it really the case that characters in Word in OS X are not stored 
as
 Unicode, even though they are so stored in Word in Windows NT (and 
later)
on
 a PC ?
 If not stored as Unicode on a Mac, then how are they stored ?




Re: MS Windows and Unicode 4.0 ?

2003-12-03 Thread Deborah Goldsmith
Actually, Mac OS X 10.3 Panther includes a set of Cherokee fonts.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Dec 3, 2003, at 8:59 AM, D. Starner wrote:

It wouldn't
hurt you at all to release them in some form, and it would help the
Cherokee (for instance) to actually use Unicode for their language.




Re: MS Windows and Unicode 4.0 ?

2003-12-03 Thread Deborah Goldsmith
Of far more value than Apple employees pressuring Apple's management to 
cough up the money for new fonts would be Apple's *customers* telling 
Apple's management they want to see such fonts.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Dec 3, 2003, at 7:55 AM, John Jenkins wrote:

Personally, I'd rather pressure Apple's management to pay Michael for 
new fonts for additional scripts than to pay him to draw new localized 
versions of the LR font.




Re: Free Fonts

2003-12-03 Thread Deborah Goldsmith
As far as I know there is no legal issue with adding hints to fonts. 
Any legal issue around font hinting is going to relate only to the 
software which takes fonts and produces rendered glyphs on a display 
device, not to the fonts themselves.

Apple does not ask font developers to pay royalties for anything 
related to font development or distribution. So if you add hints to a 
font, no, you do not owe Apple any royalties, nor is there any other 
legal issue I'm aware of. We *want* people to produce high-quality 
fonts, including high-quality cross-platform fonts.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Dec 3, 2003, at 10:38 AM, Raymond Mercier wrote:

Philippe Verdy writes
Simple: for now the fonts are in beta, and do not include the hinting
instructions. This may be in development, but faces some legal issues
with Apple patents. So until there's a patent-free hinting mechanism,
for use in fonts, or Apple agrees with a royaltee-free license on
hinting mechanisms, hinted fonts cannot be freely distributed.
What is the legal position if these fonts are taken into Fontlab and
rehinted ?
Surely if I make my own hinted font in Fontlab I do not owe royalties 
to
Apple.

Raymond Mercier







Unicode support in Mac OS X (was: Re: How can I have OTF for MacOS

2003-12-02 Thread Deborah Goldsmith
Mac OS X currently supports Unicode in all Cocoa applications. Unicode 
support in Carbon applications depends on the application; the Finder 
and iTunes are examples of Carbon applications that support Unicode. 
Apple is migrating all of its applications to Unicode and strongly 
encourages developers to do the same.

Apple does not currently provide tools for adding OpenType capabilities 
to fonts; such tools are available from Adobe and Microsoft. Apple does 
provide tools for adding AAT tables to fonts; as has been pointed out 
earlier, AAT and OpenType tables can coexist in the same font. Mac OS X 
does not currently recognize OpenType tables at the system level, 
though many applications do. To get system level support it is 
necessary to add AAT tables to a font.

The tools and documentation are available from:

http://developer.apple.com/fonts

How to add keyboard layouts to Mac OS X is documented in Technical Note 
2056, which I mentioned yesterday:

http://developer.apple.com/technotes/tn2002/tn2056.html

Keyboard layouts can either be the same binary form as was used on Mac 
OS 9, or a new XML format which is documented in the tech note. Two 
resources for creating the new XML-style keyboards without editing the 
XML directly are:

http://homepage.mac.com/poorant79/software/
http://wordherd.com/keyboards/
Finally, information like date formats, collation order, and the like 
come from ICU, the open source library (as of Mac OS X 10.3, Panther):

http://oss.software.ibm.com/icu/

The best way to add new locale data to Mac OS X is to contribute it to 
ICU via the Common Locale Data Repository project (see the ICU page).

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Dec 2, 2003, at 3:22 AM, Mustafa Jabbar wrote:

Dear Sir,
Can you let us know about how we can have support of Unicode in 
MacOSX? What are the tools to create OTF for MacOSX and What are the 
tools for developing Keyboard for Uniocde under MAC OS X. What are the 
tools Apple is providing like VOLT and MSKLC?
What are the applications supports Unicode?
We have Bangla solution for Mac. But how we can update it for MacOSX? 
Fonts and the Keyboard?
Apple should speak up.
Mustafa Jabbar

At 04:36 PM 01-12-03 -0700, John Jenkins wrote:

On Dec 1, 2003, at 4:24 PM, Frank Yung-Fong Tang wrote:

John What 'cmap' format Apple use in the MacOS X
Devanagari and Bangla fonts?
The formats are irrelevant; the Mac supports all the 'cmap' subtable 
formats for all subtables.  For rendering complex scripts, however, 
the font can only be rendered through ATSUI (or Cocoa), because the 
old way to support complex scripts  via an 'itl5' resource in the 
suitcase with the 'FOND' and 'sfnt' resources  is not supported on 
X.

Apple really, really wants everybody to move to using Unicode in 
their applications for all their text, and Apple really, really, 
*really* wants people to do it for complex scripts.


John H. Jenkins
[EMAIL PROTECTED]
[EMAIL PROTECTED]
http://homepage..mac.com/jhjenkins/


--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.






Re: How can I have OTF for MacOS

2003-12-01 Thread Deborah Goldsmith
On Nov 24, 2003, at 6:47 PM, John Jenkins wrote:
Keyboards are just XML files.  If you're on Mac OS X 10.3, you can 
find samples inside 
/System/Library/Fonts/Unicode.bundle/Contents/Resources/*.keylayout.
Documentation on the XML keyboard file format for Mac OS X can be found 
in Apple Tech Note 2056:

http://developer.apple.com/technotes/tn2002/tn2056.html

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Unicode support for Khmer

2003-10-29 Thread Deborah Goldsmith
On Oct 27, 2003, at 12:30 PM, Sue and Maurice Bauhahn wrote:
Nothing of the sort is available on Macintosh (partly because Mac
applications that support ATSUI appear to be even more rare;-().
Unicode support in applications is widespread on Mac OS X. In 
particular, all the built-in applications should be able to handle 
Khmer with a properly constructed font. The system already supports 
Thai out of the box, via Unicode. It's true that neither a font nor a 
keyboard for Khmer are included with the OS.

Deborah Goldsmith
Manager, Fonts / Unicode liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: About that alphabetician...

2003-09-25 Thread Deborah Goldsmith
I already wrote this up internally as a bug.

Thanks,
Deborah
On 2003/09/25, at 14:05, Tom Gewecke wrote:

About the c-cedilla, it appears that OS X Safari does not  pick  up 
the charset on this page.  If the default is set to UTF-8, the c 
disappears altogether.  The  correct character is displayed only if 
the browser is set by default  or manually to Latin 1.






Re: Last Resort Glyphs (was: About the European MES-2 subset)

2003-07-19 Thread Deborah Goldsmith
Apple's version of the Last Resort font is a (relatively) normal font. 
It just has a cmap that maps lots and lots of characters to the same 
glyph. :-)

Deborah Goldsmith
Manager, Fonts / Unicode Liaison
Apple Computer, Inc.
[EMAIL PROTECTED]
On Saturday, July 19, 2003, at 12:15  PM, Michael Everson wrote:

Um, that's what the Last Resort font does, outside of Unicode encoding 
space. (I don't think PUA characters are used, actually, but I could 
be wrong.





Re: Normalized text in OS X?

2003-03-10 Thread Deborah Goldsmith
On Monday, March 10, 2003, at 05:07  AM, Michael Everson wrote:

What does confuse me is that I then tried to search for the strings, 
and I found that the Find engine considered the two strings to be 
different. Should it?
No, and the bug was filed a while back. :-)
(I also found that when I saved and closed the file and reopened it, 
the combining ogonek was stripped out, which I suppose must be a bug.)

Yes, that's a known bug, too.

Deborah Goldsmith
Manager, Fonts / Unicode Liaison
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Unicode keyboard layouts oddity in OS X 10.2.4

2003-02-19 Thread Deborah Goldsmith
There are two problems we have seen with keyboard preferences.

1. Bringing up the force-quit dialog (command-option-escape) can 
sometimes disable keyboards in ~/Library/Keyboard Layouts. This can be 
worked around by moving them to /Library/Keyboard Layouts. Please let 
me know if this is part of the problem.

2. Sometimes other keyboards will not remain enabled over logoff/logon, 
even if they are not in ~/Library/Keyboard Layouts.

Please do the following in Terminal:

defaults read com.apple.HIToolbox Keyboard Menu

The normal result is:

The domain/default pair of (com.apple.HIToolbox, Keyboard Menu) does 
not exist

If you get a different response, please contact me by private e-mail.

Thanks,

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Wednesday, February 19, 2003, at 05:34  AM, Kino wrote:

Greetings

I have created several Unicode keyboard layouts for OS X 10.2.x which 
are available at
	http://quinon.com/files/keylayouts/
Usually I have activated two of them: LatinTL and ArabicQWERTY.

After updating to OS X 10.2.4, Unicode keyboard layouts checked in 
Input Menu tag of Internet Preferences do not stick anymore. I.e. with 
each restart, they vanish from Flag menu and become unchecked in Input 
Menu.

My settings in Input Menu tag of Internet Preferences have not always 
been retained even before 10.2.4. Sometimes one of checked keyboard 
layouts vanished or was replaced with another, e.g. my ArabicQWERTY 
replaced with Apple's Arabic. But these glitches were not always 
reproductible at least with 10.2.1-10.2.3.

Now, even common keyboard layouts such as Unicode Hex Input do not 
seem to stick. I have not tested extensively with Apple keyboard 
layouts though.

Of course, I suspected my system installation. So I clean installed OS 
X 10.2 on another partition and created a new user. I have not tested 
with each updater, but OS X 10.2.1 retains Unicode keyboard layouts I 
have chosen whereas 10.2.4 does not.

Is this a bug? Or something is wrong with my keyboard layouts?

Yusuke Kinoshita


Yusuke Kinoshita








Re: BOM's at Beginning of Web Pages? Mac IE's Euro

2003-02-17 Thread Deborah Goldsmith
I can't explain why IE 5.2.2 is displaying the UTF-8 BOM as a Euro. 
It's important to understand that IE 5.2.x does not use Unicode for 
rendering. It takes the following approach:

1. Convert the text from the specified character set to runs of text in 
various Mac OS encodings.
2. Draw each text run using a font appropriate for that Mac OS encoding.

Any character that cannot be converted into a Mac OS encoding will be 
converted into ?. If you look at the source for the Unicode home page 
in IE, you'll see that there is indeed a ? at the beginning, as there 
is no mapping from U+FEFF to any Mac OS encoding. I have no idea why 
that is rendering as €.

I've already written a bug against Safari that it should handle BOMs 
(browsers need to handle lots of things that are not legal HTML).

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: How is glyph shaping done?

2003-02-03 Thread Deborah Goldsmith
For information on how this is handled on Mac OS, please see:

http://developer.apple.com/fonts/

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Friday, January 31, 2003, at 11:03  AM, John Hudson wrote:


On Windows, the shaping engines for complex scripts are part of 
Uniscribe (usp10.dll) and make use of OpenType font technology. An 
Arabic OpenType font will contain layout features for Initial init, 
Medial medi and Final fini substitutions (and possibly Isolated 
isol, e.g. to handle contextual variation of the letter heh). 
Uniscribe analyses strings of Arabic text, keeps track of the position 
of letters and their neighbours, and implements the appropriate layout 
feature for each letter.

For more information, see 
http://www.microsoft.com/typography/developers/opentype/default.htm, 
and the MS Arabic font specification at 
http://www.microsoft.com/typography/specs/default.htm






Re: Quick ATSUI question

2002-11-20 Thread Deborah Goldsmith
You should always keep ATSUStyle objects around as long as possible.

Whether it's faster to keep ATSUTextLayout objects persistently or to 
create and then destroy them depends on your application. If you redraw 
the same paragraphs repeatedly, it's to your advantage to retain the 
ATSUTextLayout objects for those paragraphs in a cache. If you draw 
text once and then don't draw it again for a long time, you can either 
recycle the ATSUTextLayout object or create a new one; it's not that 
expensive to create the object.

The things that are expensive are creating ATSUStyle objects and going 
through the layout process. For more information, see

http://developer.apple.com/qa/qa2001/qa1027.html

I strongly suggest you ask questions like this on the 
Carbon-Development mailing list rather than here on the Unicode list. 
You'll find many more people able and willing to help.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Wednesday, November 20, 2002, at 10:07 AM, Theodore H. Smith wrote:

Hi list,

I'd like to know which scheme turns out noticably faster. If neither 
is noticably faster, then thats good to know.

1) Using ATSUCreateTextLayoutWithTextPtr, then disposing the object, 
and creating a new one for each paragraph. This means I have to reset 
line controls and layout controls for each object.

2) Using ATSUCreateTextLayout, then setting the styles one by one using
ATSUSetRunStyle? I will reuse the line controls and layout controls.

Thanks for your answer. I'm guessing the answer will be 1, or 2, 
or neither :o). In other words, I don't need a detailed answer, but 
if the whim takes you then do so.

--
Theodore H. Smith - Macintosh Consultant / Contractor.
My website: www.elfdata.com/








Re: ATSUI text length parameters

2002-11-19 Thread Deborah Goldsmith
On Tuesday, November 19, 2002, at 08:49 AM, Theodore H. Smith wrote:

I want to pass some text to an ATSUI function. One of the params is 
UniCharCount iTextTotalLength. Does this include surrogates? IE, is 
this truely a CharCount as it's name? Or simple a ushort count, 
hence the size of the buffer / 2.

I have a strange feeling that it is the ushort count, and not a char 
count like it claims.

UniCharCount is, as its name implies, a count of UniChars. UniChar is a 
16 bit data type representing one UTF-16 code unit. UniChar is a bit of 
a misnomer, but then again I think it dates from before surrogates were 
invented, so it can be forgiven.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: ATSUI for MacOS9

2002-11-19 Thread Deborah Goldsmith
ATSUI is supported under Carbon on Mac OS X, Carbon on Mac OS 9, and 
native Mac OS 9. The native Mac OS 9 version and Carbon Mac OS 9 
versions are frozen (the native one at an earlier level); the OS X 
version continues to evolve.

I don't know why the demo doesn't do hit testing properly, but ATSUI is 
supposed to work on 9. I'd strongly recommend using the CarbonLib 
version.

Speaking of which, run, do not walk, to:

http://www.lists.apple.com/mailman/listinfo/carbon-development

and sign up for the carbon-development mailing list. You will find it a 
more fruitful place to ask questions about Apple technologies.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Tuesday, November 19, 2002, at 07:33 AM, Theodore H. Smith wrote:

Hi list,

I hope ATSUI questions are allowed. ATSUI involves drawing of Unicode 
text on screen or into off-screen graphics buffers.

I'd like to know if ATSUI can be used for MacOS9. The ATSUI demo for 
OSX works perfectly, but the ATSUI demo for OS9, can't do horizontal 
hit testing. :o(

Why not? Is this a bug in the demo, or a bug in ATSUI for OS9? Does 
ATSUI for Carbon on OS9 work if ATSUI for Classic OS9 doesn't?

If anyone knows ATSUI well, could you please contact me so I can ask a 
few more questions? Thanks a lot.

--
Theodore H. Smith - Macintosh Consultant / Contractor.
My website: www.elfdata.com/








Re: Info: Apple OSX Font Tools Suite 1.0.0 Released

2002-11-12 Thread Deborah Goldsmith
Faster: option-click the link. It will force a download.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Tuesday, November 12, 2002, at 09:20 AM, John H. Jenkins wrote:


Try control-clicking on the link and then selecting Save link to 
disk from the popup menu.

On Tuesday, November 12, 2002, at 09:55 AM, Dean Snyder wrote:

At 4:49 PM John H. Jenkins wrote:


Cupertino 11/8/02: Today the Apple Font Group released its new suite 
of
Unix command line font tools for OSX.

These can be downloaded free from http://developer.apple.com/fonts/.

The actual download URL is:

http://developer.apple.com/fonts/FontToolsv1.0.dmg

But I can't get it to download with any browser I've tried (IE, Opera,
Mozilla) - they all display the binary disk image as garbled text 
instead
of downloading it to disk. (I've fiddled with download helper 
preferences
for .dmg files but that hasn't helped. Is the .0.dmg file name
termination confusing the browsers?)


Respectfully,

Dean A. Snyder
Scholarly Technology Specialist
Center For Scholarly Resources, Sheridan Libraries
Garrett Room, MSE Library, 3400 N. Charles St.
The Johns Hopkins University
Baltimore, Maryland, USA 21218





Re: entering JIS 0213, HKSCS and GB 18030 characters

2002-11-05 Thread Deborah Goldsmith
On Thursday, October 31, 2002, at 01:05  AM, Eric Muller wrote:

We have a very hard time assembling the following information: on 
MacOS X and Windows XP, how do users practically enter JIS 0213, HKSCS 
and GB 18030 characters? We are interested by both OS provided IMEs 
and third party IMEs. Of course, we are interested in the more 
recent characters in those sets, e.g. those in 0213 and HKSCS that 
map to Plane 2.

Can anybody help? I'll summarize to the list.

On Mac OS X, all of these require an application that supports Unicode 
input, such as TextEdit, Mail, the Finder, iPhoto, etc. Mac OS X 10.2 
includes font support for the full JIS X 0213 and GB 18030 repertoires; 
there is not currently full font support for HKSCS.

1. JIS X 0213

- Our Japanese input method, Kotoeri, has words with JIS X 0213 
characters in its dictionary, such as moriougai. The second candidate 
for ougai is a JIS X 0213 character, as indicated by the little green 
triangle in the candidate window. In this way JIS X 0213 characters 
show up just as any other characters.

- Using the Character Palette (new in Mac OS X 10.2), JIS X 0213 
characters can be entered either using the Japanese view (e.g., via 
radical/stroke lookup) or the Unicode view, which lists characters by 
Unicode blocks or code points, including Extension B.

2. GB 18030

- The Simplified Chinese input method has a direct GB 18030 code point 
entry method.

- The Character Palette can be used as for JIS X 0213, and the 
Simplified Chinese view includes GB 18030 characters in the 
radical/stroke tab.

- The SC input method is extensible, and we include a sample file with 
Pinyin support for GB 18030: /Applications/Utilities/Asia Text 
Extras/Plugin_Text_Sample/AllGB18030-PinYinPlugin.dat. If this file is 
copied to ~/Library/ChineseInputMethodPlug-in/, when you next login you 
will have this support.

3. HKSCS

- At this time, the only way to enter all HKSCS characters is via the 
Character Palette. The subset that corresponds to MacTraditionalChinese 
can be entered via the Traditional Chinese input method.

I hope this helps...

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: Unicode Character names on Mac OS X 10.2

2002-10-14 Thread Deborah Goldsmith

No, the names are the names from the Unicode character database, and so 
are always in English even when the system is running in French.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Monday, October 14, 2002, at 08:14 AM, Patrick Andries wrote:

 Have the Unicode character names been localized,  as MS Windows has 
 done (at
 least for French and German which I have personally seen) ?

 In other words, have the official ISO 10646-1 and -2 French names been 
 used
 in the French version of this tool, or must French readers see an 
 English
 name for characters they know under a French name (all latin 
 characters,
 arrows, etc.) ?






Tech note on Installable Keyboard Layouts

2002-09-09 Thread Deborah Goldsmith

I'm happy to report that the Apple tech note on installable keyboard 
layouts for Mac OS X 10.2 has been published:

Tech Note 2056, Installable Keyboard Layouts
http://developer.apple.com/technotes/tn2002/tn2056.html

Please report any problems directly to me.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Mercury News: Hawaiian on a Mac

2002-09-05 Thread Deborah Goldsmith

On Thursday, September 5, 2002, at 08:32 AM, Doug Ewell wrote:
 Does everyone (Apple included) agree that the kahakō is U+0304 
 COMBINING
 MACRON and the ʻokina is U+02BB MODIFIER LETTER TURNED COMMA?

Our Hawaiian keyboard generates precomposed vowels with macron, not a 
combining macron. But yes, the kahakō is a macron. And ʻokina is U+02BB.

The following was typed with the Hawaiian keyboard if anyone is curious:

āēīōūĀĒĪŌŪʻ

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Mercury News: Hawaiian on a Mac

2002-09-05 Thread Deborah Goldsmith

On Thursday, September 5, 2002, at 03:43  PM, John Delacour wrote:
 Is there any news yet of that tech note on keyboards?  I've knocked 
 together a polytonic Greek keyboard using the old technology but I'm 
 keen to get working on the xml version and it would save me a lot of 
 time if there was some guidance available regarding all the different 
 actions etc.

There was a problem in getting it posted. I'm trying to get that 
resolved now.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Summary of Unicode/language features in Mac OS X 10.2 Jaguar

2002-08-28 Thread Deborah Goldsmith

On Wednesday, August 28, 2002, at 04:09 AM, Lars Marius Garshol wrote:
 That function also applies the bidirectional algorithm to the text it
 displays. However, since the application needs to do all manner of
 strange formatting (colouring, interspersed images, first-line
 specials, first-letter specials, base-line adjustments, ...) it calls
 the method word for word, doing the formatting itself.

This is why ATSUI (Apple's Carbon API for drawing Unicode text) is 
designed to deal with a paragraph at a time, supporting multiple fonts, 
colors, cross-stream shifts, with-stream shifts, baseline adjustments, 
leaving space for embedded images, etc. The new (to Jaguar) ATSUI 
Direct Access API allows direct program intervention in the low-level 
layout process as well.

An API that renders Unicode text correctly only when it's all in one 
font isn't terribly useful. :-)

Documentation on ATSUI Direct Access will be available whenever we 
update our online documentation to reflect 10.2, which should be over 
the coming few months, I believe. The header files are available now 
with 10.2, of course.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Summary of Unicode/language features in Mac OS X 10.2 Jaguar

2002-08-28 Thread Deborah Goldsmith

On Wednesday, August 28, 2002, at 01:55 PM, Lars Marius Garshol wrote:
 We just need a single display function that does glyph shaping and no
 more.  Is that available somewhere?

It's possible to do that with the direct access API.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Summary of Unicode/language features in Mac OS X 10.2 Jaguar

2002-08-27 Thread Deborah Goldsmith

On Tuesday, August 27, 2002, at 04:01 PM, Lars Marius Garshol wrote:
 Thank you very much for this. This kind of succinct no-nonsense
 summary is worth its weight in gold. (Collecting the same information
 through the normal sources might easily consume weeks.)

Hmmm. Since it was sent electronically, I'm not quite sure what its 
weight in gold would be. :-)

 | 3. Unicode
 |
 | Support for text in bidirectional and complex writing systems
 | (e.g. reordering) is now available in all Unicode applications,
 | including Cocoa applications.

 It would be nice to know what this means in term of the rendering
 API. Is it possible to display RTL scripts without having the display
 system reverse the text? I hope it is, since without this capability
 proper bidi support is near-impossible to achieve.

By default, RTL scripts are displayed in compliance with the Unicode 
bidirectional algorithm. It is of course possible to override this 
through use of directional override characters. It's also possible to 
use advanced APIs to disable bidi processing.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Mac OS X Keyboard Layouts (was Re: new version of Keyman)

2002-08-15 Thread Deborah Goldsmith

There is lots of good news about keyboards in Mac OS X 10.2, none of 
which I'm allowed to discuss until August 24, unfortunately. If you 
have signed an Apple non-disclosure agreement, write me privately and 
I'll blab about all of it. :-) I will be discussing all this and more 
at the San Jose Unicode conference, which, thankfully, is after August 
24.

I will try to post something on August 24 giving the basics.

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]

On Thursday, August 15, 2002, at 02:13 PM, Alex Eulenberg wrote:

 On Thu, 15 Aug 2002, Michael Everson wrote:

 Apple have made no official announcements about what keyboards will
 be included in the next or future releases of OS X.
 ...
 More good news: I hope to be releasing soon an OS X Unicode keyboard
 layout for Ogham, and of course I am developing OS X keyboard layouts
 with support for Welsh, Scottish Gaelic, and Cornish.


 Michael,

 What do you (plan to) use to create your Unicode layouts for Mac OS?
 ResEdit only lets you edit 8-bit layouts ('KCHR' resource). The only 
 tool
 I know of to create Mac OS Unicode layouts ('uchr' resources) is my own
 (free) web application, at

 http://wordherd.com/keyboards/

 On a related note I wonder if anyone can tell me whether the new Mac OS
 10.2 Jaguar lets you drag and drop, or otherwise easily add custom
 keyboard layouts, like with version 9 and previous, or whether you 
 still
 have to jump into Terminal and make an unsupported hack to the Human
 Interface Toolbox system files.

 --Alex













Language name questions

2002-05-24 Thread Deborah Goldsmith

Hi,

I am trying to determine the names of a few languages in their own 
language. This is for a list of language names that a user can select, 
like:

English
Français
日本語

and so on.

I need answers to some particular questions, but if someone could point 
me at a book or web site, then that would be even better.

Here are the languages I'm trying to pin down:

Hungarian: magyar or magyarul?
Slovak: Slovenský?
Slovenian: Slovenski? Slovensko?

Any help would be gratefully appreciated!

Deborah Goldsmith
Manager, Fonts  Unicode
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Language name questions

2002-05-24 Thread Deborah Goldsmith

On Friday, May 24, 2002, at 05:43 PM, Mark Davis wrote:

 http://oss.software.ibm.com/cgi-bin/icu/lx/en_US/utf-8/?_=sk

This has Slovenina, but we've also seen Slovensk.

Deborah





Re: display issue on mac

2002-04-30 Thread Deborah Goldsmith

On Tuesday, April 30, 2002, at 01:35  PM, Tom Gewecke wrote:
 Russian characters have an extra spacing on Mac in both browsers (no
 problem on pc).
 T h e   c h a r a c t e r sl o o k   l I k e   t h I s there is no
 actual space between them. I have been testing on OS X using IE 5.1 and
 NS 6.2. This page has the same issue
 http://www.unicode.org/unicode/standard/translations/russian.html .
 I have spent more than 10 hours trying to figure out the problem, this
 is my last hope.

 It's because your browser is using the Japanese Hiragino font, with its
 double-width characters, for Cyrillic.  Deactivate or remove this and it
 should look normal. Only a problem for OS X I believe.

That is the problem, but it's more general. Shift JIS contains Cyrillic, 
and IE and Netscape on the Mac do not give a way to control the sequence 
of fonts used for UTF-8 display. It's getting to Japanese fonts before 
Cyrillic fonts. This is not specific to either Hiragino or OS X; it 
happens with any Japanese font and both OS 9 and OS X.

One workaround is to specify a Cyrillic font for display of UTF-8 pages, 
but that requires the end user to configure their browser. Another 
workaround is to remove all Japanese support from the OS, but that is 
pretty draconian, and is not supported on OS X (/System/Library/Fonts on 
OS X is only modifiable by the superuser).

Deborah Goldsmith
Manager, Fonts
Apple Computer, Inc.
[EMAIL PROTECTED]





Variant locales?

2002-04-22 Thread Deborah Goldsmith

I had a recent inquiry from inside Apple as to whether there was a 
registry of variants of the standard ISO locales, e.g. ja_JP.kana for 
Japanese written only with kana. Does anyone know if there is any 
standard that attempts to describe such things?

Thanks,

Deborah Goldsmith
Manager, Fonts  Languages
Apple Computer, Inc.
[EMAIL PROTECTED]





NTT DoCoMo i-Mode graphic characters contact?

2002-04-12 Thread Deborah Goldsmith

Apple is investigating whether and how to propose the encoding in 
Unicode of the graphic characters used on i-Mode phones in Japan. In 
doing so it would be very helpful to have a contact at NTT DoCoMo who is 
involved in the definition of these characters and who might want to 
participate in a standardization effort.

If anyone knows of any such contacts, I would very much appreciate it if 
they could put me in touch with them.

Deborah Goldsmith
Manager, Fonts  Languages
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: GB 18030 question

2002-02-05 Thread Deborah Goldsmith

Thanks to everyone for researching this question!

Deborah

On Monday, February 4, 2002, at 06:25 PM, Qingjiang (Brian) Yuan wrote:

 Frank and Deborah,
   After I saw the e-mail from Deborah, I asked our Beijing office to
 contact the CESI. The follow is the information we got:

 --
 Have contacted with CESI. It is really a glyph bug. They have fixed it,
 but they did not notify us!

 CESI will not give us the updated fonts until tomorrow morning. It was
 said that there are serial glyph have been updated in the new version of
 the bitmap fonts.
 --

 Thanks.
 Brian.

 Yung-Fong Tang Wrote:

 I looks like both Mac/Linux/Window N6.2 and current Mozilla map that to
 FFE3. Looks like IE on winXP do the same way.

 We, mozilla i18n group, got the GB18030 mapping table from sun. B Yuan,
 any comment?

 Michael Everson wrote:

 At 11:23 -0800 2002-02-01, Deborah Goldsmith wrote:

 There is an error on page 10 of the GB 18030-2000 standard, in that
 the character with code point A3FE maps to U+FFE3 (FULLWIDTH MACRON),
 but is shown with a glyph that corresponds to U+FF5E (FULLWIDTH
 TILDE). The position of the character in its code block would also
 seem to indicate that tilde was intended.

 Does anyone have any idea of which should be considered correct, the
 glyph or the Unicode mapping value?


 Glyphs are informative in JTC1. I can only assume that the GB
 standards would follow suit.






GB 18030 question

2002-02-01 Thread Deborah Goldsmith

There is an error on page 10 of the GB 18030-2000 standard, in that the 
character with code point A3FE maps to U+FFE3 (FULLWIDTH MACRON), but is 
shown with a glyph that corresponds to U+FF5E (FULLWIDTH TILDE). The 
position of the character in its code block would also seem to indicate 
that tilde was intended.

Does anyone have any idea of which should be considered correct, the 
glyph or the Unicode mapping value?

Deborah Goldsmith
Manager, Fonts  Language Kits
Apple Computer, Inc.
[EMAIL PROTECTED]





Re: Kana and Case (was [totally OT] Unicode terminology)

2000-11-28 Thread Deborah Goldsmith

On Monday, November 27, 2000, at 06:55 PM, [EMAIL PROTECTED] wrote:

 You mean this?
 
 _|_ ||
  |_
 /| \/
 \|_/\

Yes.

 How do you *say* it?

I haven't a clue...

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Kana and Case (was [totally OT] Unicode terminology)

2000-11-27 Thread Deborah Goldsmith

on 11/22/00 3:00 PM, Kenneth Whistler [EMAIL PROTECTED] wrote:

 It isn't really nonce usage, but rather the adoption of the formal
 spelling mechanism of Katakana into Hiragana to indicate prosodic
 length. The place you'll see this usage of the prolonged sound
 mark fairly frequently is in Japanese comics, which are rather
 loose and inventive in their use of spellings and "paraspellings"
 to convey tone of voice and other prosodic information.

Another example is the use of dakuten on characters they're not normally
applied to (e.g. U+3042 U+3099).

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: Open-Type Support (was: Greek Prosgegrammeni)

2000-11-27 Thread Deborah Goldsmith

on 11/22/00 4:19 AM, Marco Cimarosti [EMAIL PROTECTED] wrote:

 - AAT/ATSUI (see in http:/www.apple.com).
 Most of the "intelligence" is in the font itself, which also includes a
 state machine to operate substitution. The behavior of the smart fonts may
 be influenced by external user settings.

John Jenkins already related most of the relevant information, but I'll just
add that:

http://fonts.apple.com/

is a much better place to go if you're looking for information on AAT, and

http://www.apple.com/developer/

is the place to go for information on ATSUI.

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: Mac support of UCAS in Unicode 3.0

2000-10-27 Thread Deborah Goldsmith

On Friday, October 27, 2000, at 12:15 PM, Aki Inoue wrote:

 OmniWeb is one of few Web browsers that display UTF-8 encoded Web pages.  As 
 long as you have UCAS font with Unicode cmap, you should be able to display 
 it with the browser. 
  

More precisely, all Mac web browsers can display UTF-8 encoded pages, but all of them 
are limited to the subset of Unicode supported by the union of Mac OS encodings, 
except for OmniWeb.

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]



Re: Mac support of UCAS in Unicode 3.0

2000-10-26 Thread Deborah Goldsmith

on 10/26/00 3:55 PM, Nesbitt, Gavin [EMAIL PROTECTED] wrote:

 To focus this advice, perhaps I can ask whether Unicode syllabic encoded web
 pages would be technically viewable on Macs? If so, what would be required
 to do so and if not, what might be required to get it supported...?

All currently available Mac web browsers, to the best of my knowledge,
render web pages using Mac OS character sets. If they receive Unicode
characters, they convert them to Mac OS encodings first, then display the
results of that conversion. The only exception I am aware of to this rule is
the OmniWeb application which runs only on Mac OS X.

Some web browsers use the Mac OS Text Encoding Converter to convert from
Unicode to Mac OS encodings. Theoretically, you could write a TEC plugin to
handle UCAS. However, I don't know if these browsers will recognize
additional encodings provided via a plugin and make them available, since I
haven't tried it.

There has been an API (ATSUI) in Mac OS since Mac OS 8.5 which can render
all of Unicode. No web browser that runs on Mac OS 8.x/9.x uses this API, to
the best of my knowledge. I think that Mozilla did at one point, but I don't
know if that support is still enabled. Developers are starting to adopt
ATSUI and the text editing API based on it (MLTE), but not any browser
vendors I'm aware of.

Basically, there is not a lot of software on OS 8 and 9 that supports
Unicode directly (i.e., all of Unicode, not just the subset Mac OS could
already handle). Mac OS X has lots of applications that use Unicode
directly, including the Finder and the built-in mail program and text
editor.

If you want web browsers to support UCAS on Mac OS 8.x/9.x you will need to
talk to the browser vendors: Microsoft, Netscape, and iCab being the most
well-known.

Feel free to contact me directly for more information.

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: Mac Questions

2000-10-09 Thread Deborah Goldsmith

on 10/9/00 10:19 AM, Yung-Fong Tang [EMAIL PROTECTED] wrote:

 I tried MLTEDEMO, it seems it support TSM 1.5 so I can finally see how
 "Extended Roman (U)" and "Unicode Hex Input" keyboard layout work. Does
 this app support 'utxt' clipboard format ?
 Any app does ?

MLTE Demo and SUE ought to handle 'utxt', since the clipboard is handled by
MLTE.

You can tell by copying from either of them and pasting into the Scrapbook.
The scrapbook lists all the scrap types that have been pasted in.

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]




Re: Mac/PC interoperability

2000-08-23 Thread Deborah Goldsmith

On Saturday, August 19, 2000, at 08:50 AM, RenE J.V. Bertin wrote:

 I have a preliminary version of the fonts ready. It works, but there seems to be an  
 issue with the conversion of the "currency" character and the character replacing 
it, 
 in documents coming from Mac. All other characters are translated correctly, and 
some 
 programs also convert that currency character correctly (i.e. from 0xdb on Mac, to 
0xa4 
 on PC), but others (e.g. RagTime5) do not. This happens with Word documents, but 
also 
 with RTF files. As far as I have been able to determine, the reverse direction, PC 
- Mac 
 goes without problems. 
 In addition, Word97 on PC does not perform any of the necessary conversions on 
Macintosh 
 RTF 
 at all! 
  
 Is there anyone who has experience with this kind of problem in general? What is the 
 matter with that one specific character? I contacted the RagTime help desk (they 
were 
 very kind, even though I only have a demo version), and they suggested me to ask 
"somebody 
 at unicode", since their program uses unicode internally. 
  

In Mac OS 8.5, the definition of 0xDB in the MacRoman character set changed from 
U+00A4 (currency sign) to U+20AC (Euro). Mac OS 8.5 and later treat 0xDB as the Euro.

If you are omitting characters that don't translate, then perhaps you should omit this 
one...

I hope this helps.

Deborah Goldsmith
Manager, International Toolbox Group
Apple Computer, Inc.
[EMAIL PROTECTED]