Angus Leeming wrote:
Abdelrazak Younes <[EMAIL PROTECTED]> writes:
Can you try to change the utf-8 to ucs-4 conversion to use either
"UCS-4BE" or "UCS-4LE", instead of "UCS-4"? Also the conversion the
other way ucs-4 -> ucs-2, try with UCS-2BE and UCS-2LE.
And with the attached patch where I have put LE everywhere, the text is
displayed correctly but the inset buttons are not. I guess I have gone
too far... we need a conbination of the second patch
(unicode_little_endian) and this one (unicode_little_endian_full).
Intel-based PCs use Little Endian byte order. Often Windows file formats use
the BOM (http://en.wikipedia.org/wiki/Byte_Order_Mark) to make it trivial for
the executable decoding the UTF-8 file format to decide whether the file was
stored in Big Endian or Little Endian format. The BOM tends not to be used on
Unix machines though as it messes with the sh-bang mechanism and isn't
actually needed at all anyway...
We really need a Microsoft expert for lyx ;-)
I hope you won't live us in the dark once you moved on to the US.
The fact that you need to tell LyX to convert your files from UTF-8 to the
Little Endian flavour of UCS-4 etc suggests that your UTF-8 files are encoded
in Big Endian format.
If that is the case and IIUC it is lyx2lyx fault then, isn't it?
What happens if you run such a file through iconv,
converting explicitly to UTF-8LE? Do the two files compare identical or are
they indeed changed?
I'll try to do that.
The labels (Chapter, Foot, etc) are extracted separately from the layout
files, no? (And translated using the gettext machinery.) That, therefore, is a
secondary issue.
Well, the reference labels (and cross-reference) are part of the lyx
format aren't they?
Abdel.