Re: [Mailman-Developers] [Bug 985149] Add List-Post value to permalink hash input

2012-04-24 Thread Stephen J. Turnbull
On Tue, Apr 24, 2012 at 1:40 PM, Jeff Breidenbach j...@jab.org wrote: Is 4 bytes too short? Four characters is only about a million combinations. First collision is 50% likely at 1200 messages, and multi-million message databases are completely screwed. If we're willing to impose

Re: [Mailman-Developers] Speaking about kitties (or archivers)

2012-04-24 Thread Toshio Kuratomi
On Mon, Apr 23, 2012 at 06:20:18PM -0400, Barry Warsaw wrote: Thanks for posting this Pierre-Yves! On Apr 23, 2012, at 08:17 PM, Pierre-Yves Chibon wrote: archive-core (store the emails and expose them through an API) -- archivers/stats/NNTP The questions are then: - how do we store

Re: [Mailman-Developers] [Bug 985149] Add List-Post value to permalink hash input

2012-04-24 Thread Stephen J. Turnbull
Thanks! On Wed, Apr 25, 2012 at 2:50 AM, Jeff Breidenbach j...@jab.org wrote: 0. Assume a 10 million message archive. 1. What percentage of permalinks need another click? 2. What percentage of permalinks will result in a list of more than 10 matches? Ignoring cross posts, for a 4 character

Re: [Mailman-Developers] [Bug 985149] Add List-Post value to permalink hash input

2012-04-24 Thread Jeff Breidenbach
I apologize, the simulation code had a flaw. I'm embarrassed that I didn't immediately recognize this immediately from intuition. We could get even more accurate results by computing actual SHA-1 of actual message-ids, but I'm not sure it is worth the effort. Here is a revised program and the