New submission from Mingye Wang:

Mappings for 0x81 and 0x8D in multiple Windows code pages diverge from what 
Windows does. Attached is a script that tests for this behavior. (These two 
bytes are not necessary the only problems, but for sure they are the most 
widespread and famous ones. Again, refer to Unicode best fit for something that 
works.)

This problem is seen in Python 2.7.10 on Windows 10b14959, but apparently it is 
known since long ago[1]. Python 3.4.3 on Cygwin also fails 
``b'\x81\x8d'.encode('cp1252')``.
  [1]: https://ftfy.readthedocs.io/en/latest/#module-ftfy.bad_codecs.sloppy

----------
components: Unicode
files: pycp.py
messages: 280914
nosy: Artoria2e5, ezio.melotti, haypo
priority: normal
severity: normal
status: open
title: Non-Windows mappings for a couple of Windows code pages
type: behavior
versions: Python 2.7, Python 3.3, Python 3.4, Python 3.5, Python 3.6, Python 3.7
Added file: http://bugs.python.org/file45497/pycp.py

_______________________________________
Python tracker <rep...@bugs.python.org>
<http://bugs.python.org/issue28712>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe: 
https://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com

Reply via email to