Re: [Mailman-Developers] Improving the archives

2007-07-26 Thread Dale Newfield
Jeff Breidenbach wrote: So I just looked at 2 million raw messages from 2007, spread over a few thousand mailing lists (all data is from mail-archive.com). My first question was - when comparing only with messages from the same list - how many times do I see a repeated message-id? The answer

Re: [Mailman-Developers] Improving the archives

2007-07-26 Thread Jeff Breidenbach
If you improve the script or find numbers that lead to different conclusions, now's the time to know! Live and learn! So I just looked at 2 million raw messages from 2007, spread over a few thousand mailing lists (all data is from mail-archive.com). My first question was - when comparing only

Re: [Mailman-Developers] Improving the archives

2007-07-26 Thread Jeff Breidenbach
If you are relying on the sender to do the right thing, then why not force them to create proper message-ids? I think Barry's proposal is essentially a numbers game - e.g. he's hoping for significantly better results using Date in the calculation than not using it.