On 4/24/20 6:32 PM, Thomas Coradeschi via Mailman-Developers wrote: > > I have a test list (called ’test’) and can see that > /var/lib/mailman/archives/private/test/2020-April.txt and > /var/lib/mailman/archives/private/test.mbox/test.mbox have both been created > and have messages being written to them, but the html archive pages are not > being built. > > manually running '/var/lib/mailman/bin/arch test’ gives me
Note, do not run /var/lib/mailman/bin/arch more than once on a list without the --wipe option. > #00000 <1f9f4dc9-af84-4d49-b1ee-abdc0a612...@skylands.ibmwr.org> > figuring article archives > 2020-April > Pickling archive state into > /var/lib/mailman/archives/private/test/pipermail.pck > Traceback (most recent call last): > File "bin/arch", line 201, in <module> > main() > File "bin/arch", line 189, in main > archiver.processUnixMailbox(fp, start, end) > File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 596, in > processUnixMailbox > self.add_article(a) > File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 640, in > add_article > author = fixAuthor(article.decoded['author']) > File "/var/lib/mailman/Mailman/Archiver/pipermail.py", line 63, in fixAuthor > while i>0 and (L[i-1][0] in lowercase or > UnicodeDecodeError: 'ascii' codec can't decode byte 0xaa in position 26: > ordinal not in range(128) This looks like a manifestation of an issue we've seen before. There are multiple threads on this issue in the archive of the mailman-us...@python.org list The bulk of it is at <https://mail.python.org/pipermail/mailman-users/2019-March/thread.html> in threads with Subject: [Mailman-Users] Uncaught runner exception The bottom line is in <https://mail.python.org/pipermail/mailman-users/2019-March/084280.html>. We could never figure out where it was coming from, but the import from string import lowercase in /var/lib/mailman/Mailman/Archiver/pipermail.py was returning a string that contained many accented characters in addition to the 26 letters a-z, namely the iso-8859-1 encoding of 'abcdefghijklmnopqrstuvwxyzµßàáâãäåæçèéêëìíîïðñòóôõöøùúûüýþÿ' This is some packaging quirk in (I think) Debian's Python. If you follow all through the threads, you'll see several suggested patches for diagnosis and avoidance, and there is also a thread at <https://mail.python.org/pipermail/mailman-users/2019-May/084432.html>. All that notwithstanding, I think this is the best patch for avoiding/fixing the issue. === modified file 'Mailman/Archiver/pipermail.py' --- Mailman/Archiver/pipermail.py 2018-05-03 21:23:47 +0000 +++ Mailman/Archiver/pipermail.py 2020-04-25 02:13:46 +0000 @@ -60,7 +60,7 @@ else: # Mixed case; assume that small parts of the last name will be # in lowercase, and check them against the list. - while i>0 and (L[i-1][0] in lowercase or + while i>0 and (L[i-1][0] in lowercase[:26] or L[i-1].lower() in smallNameParts): i = i - 1 author = SPACE.join(L[-1:] + L[i:-1]) + ', ' + SPACE.join(L[:i]) -- Mark Sapiro <m...@msapiro.net> The highway is for gamblers, San Francisco Bay Area, California better use your sense - B. Dylan _______________________________________________ Mailman-Developers mailing list -- mailman-developers@python.org To unsubscribe send an email to mailman-developers-le...@python.org https://mail.python.org/mailman3/lists/mailman-developers.python.org/ Mailman FAQ: https://wiki.list.org/x/AgA3 Security Policy: https://wiki.list.org/x/QIA9