two supposedly identical SA boxes, with slightly different report output -- help find the diff?

snowcrash+sa Tue, 28 Aug 2007 11:02:35 -0700

hi,

grr. i'm at that resorting-to-visine stage of wtf ... :-/


i've

    spamassassin --version
        SpamAssassin version 3.2.4-r564346
          running on Perl version 5.8.8

with, among numerous other ruls/plugins, FuzzyOcr/r330 installed.

i've just updated two supposedly identical boxes, building from clean
sources, and running the same setup scripts on both.

no errors in the installs.

on testing of FuzzyOcr image processing on one of its included test files with,

    spamassassin -D -t -x < FuzzyOcr/samples/ocr-animated.eml

i see in the debug output the following report on one box,

  ...
  Content analysis details:   (38.2 points, 4.0 required)

   pts rule name              description
  ---- ---------------------- --------------------------------------------------
   4.2 MID_DEGREES            MID_DEGREES
   3.7 CTYPE_8SPACE_GIF       BODY: Stock spam image part
'Content-Type' found (8
                              spc)
   0.0 HTML_MESSAGE           BODY: HTML included in message
   1.5 BAYES_50               BODY: Bayesian spam probability is 40 to 60%
                              [score: 0.4467]
   1.7 MIME_HTML_ONLY         BODY: Message only has text/html MIME parts
   2.5 HTML_IMAGE_ONLY_16     BODY: HTML: images with 1200-1600 bytes of words
   1.2 SARE_GIF_ATTACH        FULL: Email has a inline gif
   1.5 MY_CID_AND_STYLE       SARE cid and style
   2.9 DRUGS_STOCK_MIMEOLE    Stock-spam forged headers found (5510)
    16 FUZZY_OCR_KNOWN_HASH   BODY: Image with known hash
                              []
                              [Words found:]
                              ["investor" in 1 lines]
                              ["price" in 2 lines]
                              ["company" in 1 lines]
                              ["alert" in 1 lines]
                              ["valium" in 1 lines]
                              ["trade" in 1 lines]
                              ["banking" in 1 lines]
                              ["news" in 1 lines]
                              [(13.5 word occurrences found)]


and, similarly on the other box,

  ...
  Content analysis details:   (38.5 points, 4.0 required)

   pts rule name              description
  ---- ---------------------- --------------------------------------------------
   3.7 MID_DEGREES            MID_DEGREES
   1.6 CTYPE_8SPACE_GIF       BODY: Stock spam image part
'Content-Type' found (8
                              spc)
   0.0 HTML_MESSAGE           BODY: HTML included in message
   1.5 BAYES_50               BODY: Bayesian spam probability is 40 to 60%
                              [score: 0.4467]
   1.5 MIME_HTML_ONLY         BODY: Message only has text/html MIME parts
   1.5 HTML_IMAGE_ONLY_16     BODY: HTML: images with 1200-1600 bytes of words
   1.2 SARE_GIF_ATTACH        FULL: Email has a inline gif
   1.5 MY_CID_AND_STYLE       SARE cid and style
   3.5 DRUGS_STOCK_MIMEOLE    Stock-spam forged headers found (5510)
    18 FUZZY_OCR_KNOWN_HASH   BODY: Image with known hash
                              []
                              [Words found:]
                              []
                              [(13.5 word occurrences found)]


NOTE the "words found" detail in the second box's debug output :-/

trying to find what's causing the different output, i've pored over
the debug output, googl'd the lists, diff'd the config files, etc.

nada.  to my weary eye, all looks "the same".

obviously, it's not.

any hints/suggestions as to what i might've missed? how to find it?

thanks!

two supposedly identical SA boxes, with slightly different report output -- help find the diff?

Reply via email to