Jeff Breidenbach wrote:
So I just looked at 2 million raw messages from 2007, spread over
a few thousand mailing lists (all data is from mail-archive.com). My
first question was - when comparing only with messages from the
same list - how many times do I see a repeated message-id? The
answer
If you improve the script or find numbers that lead to different
conclusions, now's the time to know!
Live and learn!
So I just looked at 2 million raw messages from 2007, spread over
a few thousand mailing lists (all data is from mail-archive.com). My
first question was - when comparing only
If you are relying on the sender to do the right thing, then
why not force them to create proper message-ids?
I think Barry's proposal is essentially a numbers game - e.g.
he's hoping for significantly better results using Date in
the calculation than not using it.