Lars Gustäbel added the comment:

IIRC, tarfile under 2.7 has never been explicitly unicode-safe, support for 
unicode objects is heterogeneous at best. The obvious work-around is to work 
exclusively with str objects.

What we can't do is to decode the utf-8 pathname from the archive to a unicode 
object, because we have no way to detect an archive's encoding. We can either 
emit a warning if the user passes a unicode object to extract() or we 
implicitly encode the passed unicode object using TarFile.encoding, so that the 
os.path.join() succeeds.

Unfortunately, I am not entirely sure if there was possibly a rationale behind 
the current behaviour of extract(). This needs more inspection.

----------

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue17153>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to