On Thu, Oct 02, 2003 at 05:15:32PM +0900, - Edwin - wrote:
: 
: On Thu, 2 Oct 2003 01:54:43 -0500 Eugene Lee <[EMAIL PROTECTED]> wrote:
: > 
: > There's two issues: input and output.
: > 
: > HTML character references address the problem of displaying certain
: > characters on a web browser.  This is an output issue.
: > 
: > When you get CJKV data, you are most likely getting it in some
: > encoding. Different Asian languages use their own encoding sets. 
: > For example, if you get Japanese text, it will be encoded in JIS,
: > Shift-JIS, EUC, or something Unicode.  You *have* to determine the
: > type of data and its encoding.  This is an input issue.
: 
: Hmm... but the characters in question were already (at least in the
: examples used) in "something" Unicode (&#nnnnn;). So, there's really
: no need to know whether it's JIS, Shift-JIS, EUC, etc.

The Chinese characters in question were already converted to their HTML
numeric character references.  That's because someone made the conscious
and purposeful decision to provide the correct Unicode decimal numbers
for those Chinese characters.  Your concern about accepting CJKV data is
vague because you don't explain the source of the data or the encoding
method of the data.  You must determine these critical bits of info
before you can decide how to display the data.  There's no guarantee
that the user will send you CJKV data in nice HTML numeric character
references.  I'm not even sure exactly what you're trying to do.

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to