Gilles> ======
    Gilles> m = try.search(the_page)
    Gilles> if m:
    Gilles>     #UnicodeEncodeError: 'charmap' codec can't encode characters in
    Gilles> position 49-55: character maps to <undefined>               
    Gilles>     title = m.group(1).decode('shift_jis').strip()
    Gilles> ======

    Gilles> Has someone successfully accessed Shift-JIS-encoded Japanese
    Gilles> contents with Python?

Have you verified that the characters in position 49-55 are actually
Shift-JIS characters?  In my experience problems decoding a source string in
any given character set are because of errors in the source, not errors in
Python.

OTOH, the characters in position 49-55 look like plain old ASCII to me.
Does Shift-JIS have ASCII as a proper subset?

-- 
Skip Montanaro - [EMAIL PROTECTED] - http://smontanaro.dyndns.org/
--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to