Re: FW: A product compatibility question

2001-10-17 Thread Otto Stolz

Gary P. Grosso had written:
 I sometimes wonder if XML or some other standard will evolve toward
 some standard use of markup to denote different languages.


Mark Davis wrote:
 XML (and HTML) already give you the capability of marking language. Look at
 xml:lang. If you are using XML, you should definitely not use the language
 tags. See http://www.unicode.org/unicode/reports/tr20/. (Martin Dürst
 ([EMAIL PROTECTED]) and Asmus Freytag ([EMAIL PROTECTED]) are preparing an
 update which will be out some day.)

See also Chapter 8, Language information and text direction,
http://www.w3.org/TR/html401/struct/dirlang.html from the
HTML 4.01 specification (which also holds for standard XHTML 1.0
documents, cf. line 4 and 5 of
http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd).

Best wishes,
   Otto Stolz





Re: FW: A product compatibility question

2001-10-09 Thread Gary P. Grosso

Because of Unicode's Han unification, I was under the impression that
to get both Traditional Chinese and Simplified Chinese to really look
right would require using different fonts for each.  To have different
fonts for the same characters in a single document would seem to
require use and recognition of language tagging.

Am I just showing my ignorance on this subject?


We are working with a client who is a publisher of Chinese medical
textbooks.
Our goal is to set up a configuration that will allow layout of English,

Simplified Chinese, and Traditional Chinese characters in a single
document.

---
Gary Grosso
[EMAIL PROTECTED]
Arbortext, Inc.
Ann Arbor, MI, USA





Re: FW: A product compatibility question

2001-10-09 Thread Asmus Freytag

At 01:43 PM 10/9/01 -0400, Gary P. Grosso wrote:
Because of Unicode's Han unification, I was under the impression that
to get both Traditional Chinese and Simplified Chinese to really look
right would require using different fonts for each.  To have different
fonts for the same characters in a single document would seem to
require use and recognition of language tagging.

Am I just showing my ignorance on this subject?


If you want to show English and Chinese in the same document, unless (or 
even) if the English is strictly for Chinese audiences, you will most 
likely want to use different fonts. Standard office automation suppliers 
like Microsoft have behind the scenes support for that, so that many users 
don't even know that they are actually using a different font for Latin 
than Han.

We are working with a client who is a publisher of Chinese medical
textbooks.
Our goal is to set up a configuration that will allow layout of English,

Simplified Chinese, and Traditional Chinese characters in a single
document.





RE: FW: A product compatibility question

2001-10-09 Thread Ayers, Mike


 From: Asmus Freytag [mailto:[EMAIL PROTECTED]] 
 Sent: Tuesday, October 09, 2001 01:02 PM
 
 At 01:43 PM 10/9/01 -0400, Gary P. Grosso wrote:
 Because of Unicode's Han unification, I was under the impression that
 to get both Traditional Chinese and Simplified Chinese to really look
 right would require using different fonts for each.  To have 
 different
 fonts for the same characters in a single document would seem to
 require use and recognition of language tagging.
 
 Am I just showing my ignorance on this subject?
 
 
 If you want to show English and Chinese in the same document, 
 unless (or 
 even) if the English is strictly for Chinese audiences, you will most 
 likely want to use different fonts. Standard office 
 automation suppliers 
 like Microsoft have behind the scenes support for that, so 
 that many users 
 don't even know that they are actually using a different font 
 for Latin 
 than Han.

Oooh - a swing and a miss!

He was talking about whether or not he needed separate fonts for
Traditional and Simplified Chinese.

Generally speaking, Japanese readers want to see Chinese characters
printed in Japanese fonts when they convey Japanese test (i.e. kanji).  This
is very important to Japanese readers.

Generally speaking, Chinese readers recognize simplified or
traditional texts by the characters themselves, so a single font may be used
for both.

Generally speaking, if you are going to use both traditional and
simplified text in the same document, especially if you are reprinting the
same text in both forms, it is good to use different fonts so that it is
visually obvious which text, traditional or simplified, is in any given
section.  I would suggest a font with heavy serif (brush strokes, triangles,
etc.) for the traditional text and a sans serif rounded font for the
simplified text, but I would *recommend* that you consult a professional
Asian typographer for font selection if you are doing this.

BTW, Han is scriptgeekspeak for Chinese, in case you didn't
know.


/|/|ike

 We are working with a client who is a publisher of Chinese medical
 textbooks.
 Our goal is to set up a configuration that will allow 
 layout of English,
 
 Simplified Chinese, and Traditional Chinese characters in a single
 document.
 
 




Re: FW: A product compatibility question

2001-10-09 Thread Gary P. Grosso

I appreciate these responses.  I am certainly not an expert in Han
unification.  I am trying to reconcile what John says with what
appears at http://www.unicode.org/charts/unihan.html.  For example,
there appear to be stylistic differences, at least, in a character
such as:
http://charts.unicode.org/unihan/unihan.acgi$0x4E9E
between fonts designed for different languages.

Regarding Asmus' contribution, I would assume that such products use
different fonts depending on what block the character is from, as
shown, e.g., at:
http://www.unicode.org/Public/3.0-Update/Blocks-3.txt

Since I don't see any definition at the level of Traditional Chinese
versus Simplified Chinese in the blocks, I don't see how an
application could properly switch fonts in this case.  Perhaps
the answer is it doesn't need to but I'll admit to being a bit
skeptical on that point.  I'm open to being convinced.

At 03:21 PM 10/9/01 -0400, John Cowan wrote:

Gary P. Grosso wrote:

Because of Unicode's Han unification, I was under the impression that
to get both Traditional Chinese and Simplified Chinese to really look
right would require using different fonts for each.


Han unification does *not* unify traditional and simplified
characters.

At 01:02 PM 10/9/01 -0700, Asmus Freytag wrote:

At 01:43 PM 10/9/01 -0400, Gary P. Grosso wrote:
Because of Unicode's Han unification, I was under the impression that
to get both Traditional Chinese and Simplified Chinese to really look
right would require using different fonts for each.  To have different
fonts for the same characters in a single document would seem to
require use and recognition of language tagging.

Am I just showing my ignorance on this subject?


If you want to show English and Chinese in the same document, unless (or 
even) if the English is strictly for Chinese audiences, you will most 
likely want to use different fonts. Standard office automation suppliers 
like Microsoft have behind the scenes support for that, so that many users 
don't even know that they are actually using a different font for Latin 
than Han.

We are working with a client who is a publisher of Chinese medical
textbooks.
Our goal is to set up a configuration that will allow layout of English,

Simplified Chinese, and Traditional Chinese characters in a single
document.



---
Gary Grosso
[EMAIL PROTECTED]
Arbortext, Inc.
Ann Arbor, MI, USA





RE: FW: A product compatibility question

2001-10-09 Thread Asmus Freytag

At 03:43 PM 10/9/01 -0500, Ayers, Mike wrote:
 Oooh - a swing and a miss!

No -- a pretty complete misunderstanding of my posting on your part.

The implication of my statements is that rich text support is required at 
least at some level of your architecture as soon as you want to go beyond 
fairly rudimentary support *even* for the common case of mixing Latin 
(English) with Han (Chinese...).

The goal stated in the original submission was to create 'documents'. If 
this means high quality publications, then plain-text is not the answer.

At 5:05 PM 10/9/01 -0400, Gary P. Grosso wrote:
Regarding Asmus' contribution, I would assume that such products use
different fonts depending on what block the character is from, as
shown, e.g., at:
http://www.unicode.org/Public/3.0-Update/Blocks-3.txt

Since I don't see any definition at the level of Traditional Chinese
versus Simplified Chinese in the blocks, I don't see how an
application could properly switch fonts in this case.  Perhaps
the answer is it doesn't need to but I'll admit to being a bit
skeptical on that point.  I'm open to being convinced.

Gary is correct that if you have a (rich) text display engine but 
plain-text data, the easiest way to do font bindings is by code point 
range. Many browsers are in this situation a lot. However, your font 
binding choices get better if you do a little bit of language recognition, 
which might well allow you to distinguish between mainland Chinese and 
Taiwanese Chinese texts, esp. since the most simplified characters have not 
been unified with traditional characters.

Such language recognition (in the Latin context) would allow one to select 
fonts that match the local typographical conventions more optimally - there 
are definite differences, e.g. in the angle of the acute accent between 
French and Polish, but they are slight enough that ignoring them is a 
viable option.

Finally, if crude output is all that is desired, the differences in the Han 
styles are slight enough that the text remains legible, even if mainland 
Chinese is rendered in a non-mainland font. Furthermore, one commonly 
allows the user to select a default fontstyle that matches their local 
background, and seeing the other language rendered in ones own font tends 
to make it more readable. (Printed publications where the same tradeoffs 
were made exist as well.)

A./