Op Donderdag 2008-02-07 skryf Rafaella Braconi: > Dear All, > > please find the latest tmx (based on the 2.4 release) at : > http://ooo.services.openoffice.org/pub/OpenOffice.org/cws/upload/localization/tmx24/ > > Great thanks to Frank and Ivo! > > Rafaella >
I had a _very_ quick look at some of them, and it seems that the Help file for zh-CN and the Vietnamese UI and help file has some encoding problems. All the others that I tried (mostly UI files) seemed ok. (I didn't try all of them.) Can you perhaps have a look at these files at least? Perhaps some validation against the others would be a good idea to make sure they are ok? I simply ran pocount over the files to see how big they are, and the parser complained because of incorrect UTF-8 data in the Vietnamese files. The zh-CN file just _looks_ wrong. For example: <tu tuid="282032"> <tuv xml:lang="en-US"> <seg>Example:</seg> </tuv> <tuv xml:lang="zh-CN"> <seg>示0x00E4��:</seg> </tuv> </tu> (the two characters before the colon is "U+FFFD REPLACEMENT CHARACTER" with notes: used to replace an incoming character whose value is unknown or unrepresentable in Unicode So I guess something went wrong somewhere. Furthermore, I think it is important to note that these TMX files does not follow the same unescaping rules of the new conversions done by Translate Toolkit 1.1. Of course, TMX files corresponding to the new unescaping rules can be generated from the set of PO files with po2tmx from the Translate Toolkit: http://translate.sourceforge.net/wiki/toolkit/po2tmx Hopefully there are solutions for all preferred working methods. Keep well Friedel --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]