On Sun, Feb 8, 2015 at 9:58 PM, Chris Angelico <ros...@gmail.com> wrote: > Those three characters are the CP-1252 decode of the bytes for U+2019 > in UTF-8 (E2 80 99). Not sure if that helps any, but given that it was > an XLSX file, Windows codepages are reasonably likely to show up.
Thanks, Chris. Are you telling me I should have defined the input file encoding for my CSV file as CP-1252, or that something got hosed on the export from XLSX to CSV? Or something else? Skip -- https://mail.python.org/mailman/listinfo/python-list