On Thu, 2 Oct 2003 01:54:43 -0500
Eugene Lee <[EMAIL PROTECTED]> wrote:

> On Wed, Oct 01, 2003 at 02:46:14PM -0400, Gerard Samuel wrote:
> : 
> : When I say that I don't know what characters Im expecting.
> : Im not talking about normal html entities, like &amp; &nbsp &lt;
> : Im talking about chinese/japanese/korean/taiwanese alphabet,
> numbers : (even punctuation if applicable).
> : Maybe Im thinking too hard, but trying to take far east languages 
> : alphanumeric charaters into account,
> : seems like overkill.
> : Feel free to correct me.
> 
> Okay, I will.  :-)
> 
> There's two issues: input and output.
> 
> HTML character references address the problem of displaying certain
> characters on a web browser.  This is an output issue.
> 
> When you get CJKV data, you are most likely getting it in some
> encoding. Different Asian languages use their own encoding sets. 
> For example, if you get Japanese text, it will be encoded in JIS,
> Shift-JIS, EUC, or something Unicode.  You *have* to determine the
> type of data and its encoding.  This is an input issue.

Hmm... but the characters in question were already (at least in the
examples used) in "something" Unicode (&#nnnnn;). So, there's really
no need to know whether it's JIS, Shift-JIS, EUC, etc.

?

- E -
__________________________________________________
Do You Yahoo!?
Yahoo! BB is Broadband by Yahoo!
http://bb.yahoo.co.jp/

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php

Reply via email to