Re: [Mailman-Developers] Improving the archives

2007-08-07 Thread Dale Newfield
Jeff Breidenbach wrote: > 5.85 million messages > That's 0.03% if you count all the messages. It is 0.008% if you > discard the top three offenders, all of which I have contacted. I'd say that's a strong argument for just using the Message-ID and simplifying this tremendously... ...Barry, do yo

Re: [Mailman-Developers] Improving the archives

2007-08-07 Thread Jeff Breidenbach
> What we really want to know is how many (non-empty) Message-ID > collisions are there that *don't* share a Date? This is the number of > messages that only-messageid loses, and that the composite identifier > method would not lose. I took a look at a larger dataset, 5.85 million messages from s