On Mon, Feb 13, 2012 at 12:42 PM, Ulrike Fischer <ne...@nililand.de> wrote:
> Am Fri, 10 Feb 2012 12:14:15 +0100 schrieb luigi scarso:
>
>> if you mean ASCII with coderange 0-255 *and*  ISO-8859-1 (Latin 1)
>> encoding there is no need to conversion;
>
> This is not true. You are mixing up unicode positions and utf8
> encoding.
>
> E.g. "ä" has the same position in unicode and latin1 (dez 228, hex
> E4). But its utf8 code consist of 16 bits (1100001110100100, hex
> c3a4) while its latin 1 code is 8-bit long (11100100).
ah yes you are right -- I've made the implicit assumption that his
file was already utf-8 encoded .
 I'm using only utf-8 from long time and I almost forget about
! String contains an invalid utf-8 sequence.

system          > tex > error on line 10 in file t1.txt: String
contains an invalid utf-8 sequence ...


(I believe  he met the error during the next tries because he wrote
> I cannot \input the file as this is not a valid ConTeXt source.
)
What I meant was, as I wrote below,
"To allow backward compatibility, the 128 ASCII and 256 ISO-8859-1
(Latin 1) characters are assigned Unicode/UCS code points that are the
same as their codes in the earlier standards"
and this is true only for iso-8859-1 .

-- 
luigi
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the 
Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : http://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________

Reply via email to