I've got an RTF document which contains this kind of encoding:

 \cf2 \'c3\'e5\u769?\'ed\'e5\'f3\'e9\'f2\cf0

That renders the word "Genesis" in the Greek, i.e. \'c3\'e5 is the
capital gamma.  As seen in another app which uses this RTF natively:

<<inline: rtf-genesis.png>>

I need to find a scriptable way to convert this kind of encoding to
UTF-8.  I've tried a few things (and Chris has offered a couple more
variants) of this general flavor:

perl -CO -pe 'use Encode; s/\\\'([0-9a-fA-F]{2})/decode("cp1252", 
chr(hex($1)))/eg'

But at best I seem to get what amounts to a format-shifted identity
function (\'ab becomes an actual 0xAB byte) which does me no good.

Any ideas?
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://www.crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Reply via email to