I'm trying to import text from an open office document (save as .sxw and read the data from content.xml inside the sxw-archive using elementtree and such tools).
The encoding that gives me the least problems seems to be cp1252, however it's not completely perfect because there are still characters in it like \93 or \94. Has anyone handled this before? I'd rather not reinvent the wheel and start translating strings 'by hand'. Anton -- http://mail.python.org/mailman/listinfo/python-list