New submission from Mark Tolonen <[email protected]>:
This is on Windows 7 SP1. Run 'chcp 65001' then Python from a console. Note
the extra characters when non-ASCII characters are in the string. At a guess
it appears to be using the UTF-8 byte length of the internal representation
instead of the character count.
Python 3.3.0a1 (default, Mar 4 2012, 17:27:59) [MSC v.1500 32 bit (Intel)] on
win32
Type "help", "copyright", "credits" or "license" for more information.
>>> print('hello')
hello
>>> print('p\u012bny\u012bn')
pīnyīn
n
>>> print('\u012b'*10)
īīīīīīīīīī
�īīīī
�ī
----------
components: IO, Unicode, Windows
messages: 155149
nosy: ezio.melotti, metolone
priority: normal
severity: normal
status: open
title: console w/ cp65001 displays extra characters for non-ascii strings.
type: behavior
versions: Python 3.3
_______________________________________
Python tracker <[email protected]>
<http://bugs.python.org/issue14227>
_______________________________________
_______________________________________________
Python-bugs-list mailing list
Unsubscribe:
http://mail.python.org/mailman/options/python-bugs-list/archive%40mail-archive.com