[issue42433] mailbox.mbox fails on non ASCII characters

2020-12-05 Thread Terry J. Reedy
Terry J. Reedy added the comment: 3.7 only gets security fixes. If and when someone merges something, that person will decide whether to backport. -- versions: -Python 3.7, Python 3.8, Python 3.9 ___ Python tracker

[issue42433] mailbox.mbox fails on non ASCII characters

2020-12-05 Thread Florian Klink
Florian Klink added the comment: Based on https://bugs.python.org/issue42433#msg382169 I added back the versions that bug is present. The PR is up to and appropriately linked (I think?) - let me know if there's anything left to be done from my side. -- versions: +Python 3.7, Python

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-30 Thread R. David Murray
R. David Murray added the comment: After thinking about it some more, I think given that when there is no non-ascii mbox will happily treat *anything* as valid on the "From " line, that we should consider blowing up on non-ascii to be a bug. --

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-29 Thread Florian Klink
Florian Klink added the comment: I opened https://github.com/python/cpython/pull/23553 - PTAL. I made this an enhancement for 3.10 - but it could probably also be backported to older versions -- keywords: +patch message_count: 4.0 -> 5.0 pull_requests: +22434 stage: -> patch review

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-27 Thread Terry J. Reedy
Terry J. Reedy added the comment: (The non-ascii chars are “ and ”, versus ascii ".) Florian, although you did not select a 'Type', selecting multiple versions implicitly claims that the current behavior is a bug. I believe R.David has explained that it is not, even if sub-optimal. Do you

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-27 Thread Terry J. Reedy
Change by Terry J. Reedy : -- versions: +Python 3.10 -Python 3.7 ___ Python tracker ___ ___ Python-bugs-list mailing list

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-22 Thread Florian Klink
Florian Klink added the comment: Yeah, not questioning here this might be badly formatted, but given these files are out there, and the parser is somewhat forgiving in other cases, it should be tolerant there as well. -- ___ Python tracker

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-22 Thread R. David Murray
R. David Murray added the comment: The problem with that archive is that it is not in proper mbox format. It contains the following line (5689): From here I was hoping to run something like “dbus-send –system –dest=Test.Me –print-reply /Japan Japan.Reset.Test string:”Hello”” You will

[issue42433] mailbox.mbox fails on non ASCII characters

2020-11-22 Thread Florian Klink
New submission from Florian Klink : I'm importing some mbox archives into my maildirs, and use `mailbox.mbox` to parse archives created by pipermail. Some of these archives seem to contain non-ascii characters, and python just throws a `UnicodeDecodeError` and refuses to process the archive.