Hi!

I want to write a method, that converts text into html-readable characters.
so I have to replace "<", ">", "&" and "\" with their named entities - that is clear.

but what about unicode and characters above the ASCII-128.

I think, if I have got a text (with or without unicode-characters) it is ok, to 
substitute all characters above ASCII-128 and all unicode charcters with &#xxx;

but how to know the right unicode-encoding for the ASCII-characters 128-255 ???

I think the first 256 unicode-characters are identical to iso-8859-1 (is this 
correct???).

so what if I want to substitute greek-characters (0370-03FF unicode and iso-8859-7)!
how do I know, how to subsitute each character?


thanks for any help
michi


Reply via email to