On Thursday 03 August 2006 14:44, Lars Gullik Bjønnes wrote: > Right. The internal format (which will be ucs-4, unless somehting > happens) has little or nothing to do with the external format.
I know and that is how it work in python as well, the internal representation is ucs-4. Not that I care. :-) Initially we decode the input encoding to the internal representation and later we encode it to the output representation. When I referred before I was talking about using ucs-4 and ucs-2 for the external format. I propose to leave the inputencoding in the file. After all xml allows it there, so why should not we? > -- > Lgb -- José Abílio