[issue9593] utf8 codec readlines error after "\x85 "

2010-08-13 Thread Joseph Copenhaver
Joseph Copenhaver added the comment: It is better, thanks. -- ___ Python tracker <http://bugs.python.org/issue9593> ___ ___ Python-bugs-list mailing list Unsub

[issue9593] utf8 codec readlines error after "\x85 "

2010-08-13 Thread Joseph Copenhaver
Joseph Copenhaver added the comment: I now recognize the issue was in regard to format problems and not python, but the area where this code will be used requires the use of the codecs module. Is there any way to get the efficiency of codecs I/O readlines() chunking behavior and specify a

[issue9593] utf8 codec readlines error after "\x85 "

2010-08-13 Thread Joseph Copenhaver
New submission from Joseph Copenhaver : The IO readlines() facility incorrectly processes utf8 files for some unknown reason. Specifically, the call generates too many entries in the lines array result after a character sequence "\x85 blah" which gets cut as ("\x85 ","