jdow writes: > I have two copies of the same message content and source sent two > minutes apart. These are the only differences in the messages as > I trimmed out the various verification data and differing times. > > ===8<--- > $ diff first second > 0a1 > > Status: U > 6c7 > < by mx-avoceta.atl.sa.earthlink.net (EarthLink SMTP Server) with SMTP > id > --- > > by mx-jacana.atl.sa.earthlink.net (EarthLink SMTP Server) with SMTP id > 9c10 > < by smtpout02.lax.untd.com with SMTP id > --- > > by smtpout01.lax.untd.com with SMTP id > 72a74,76 > > > > > > > ===8<--- > > Of course the various "id" strings all differ as well. > > The first message scored Bayes 80. The second scored Bayes 95. This > implies that Bayes is training itself on garbage as well as message > content. > > Since other sources of filtering deal with the Received: lines and the > message header id lines should Bayes be paying any attention to them, too? > > Should an id string like L8QWHGMP or an X-UNTD-OriginStamp line such as > below figure into the Bayes algorithm at all? > > X-UNTD-OriginStamp: qTKGdH6+6PX6q6wVyyDAiKpzgjuM3gNrL/xEOWaR9Ko1VNgBJE6wCw== > R
yes, to all questions. ;) Bayes is good with that stuff. --j.