I've got an RTF document which contains this kind of encoding: \cf2 \'c3\'e5\u769?\'ed\'e5\'f3\'e9\'f2\cf0
That renders the word "Genesis" in the Greek, i.e. \'c3\'e5 is the capital gamma. As seen in another app which uses this RTF natively:
<<inline: rtf-genesis.png>>
I need to find a scriptable way to convert this kind of encoding to UTF-8. I've tried a few things (and Chris has offered a couple more variants) of this general flavor: perl -CO -pe 'use Encode; s/\\\'([0-9a-fA-F]{2})/decode("cp1252", chr(hex($1)))/eg' But at best I seem to get what amounts to a format-shifted identity function (\'ab becomes an actual 0xAB byte) which does me no good. Any ideas?
_______________________________________________ sword-devel mailing list: sword-devel@crosswire.org http://www.crosswire.org/mailman/listinfo/sword-devel Instructions to unsubscribe/change your settings at above page