Loren Wilton wrote:
Things are basically working, but you don't seem to have network test enabled, and you haven't trained enough ham/spam messages yet for Bayes to kick in.

If you are starting SA using spamd, check for a -L parameter on the command line and remove it. That should enable network tests for you, and probably will help a lot.

The exact startup configuration is

/usr/sbin/spamd -D -m 20 -v -u vpopmail -d --round-robin -x -d --pidfile=/var/run/spamd.pid



To get Bayes working, you need to train it with at least 200 each ham and spam messages. Once it has that many messages it will start to feel confident about adding to the score.

       Loren


I have done an sa-learn --showdots --spam . in a folder full of spam. I would have expected it to add entries into the bayes database but it still says there are only 5 emails in the bayes.




----- Original Message ----- From: "Jimmy Stewpot" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Cc: <users@spamassassin.apache.org>
Sent: Wednesday, March 28, 2007 1:37 AM
Subject: Re: spamassassin not checking emails correctly.


--[ UxBoD ]-- wrote:
First thing first. Could you run a spamassassin -D --lint as the user which is scanning the email, and post the results please. This will allow people to diagnose the problem more easily.

Regards,

UxBoD

On Wed, 28 Mar 2007 09:12:20 +0100, Jimmy Stewpot <[EMAIL PROTECTED]> wrote:
Hello,

I have recently installed spamassassin on my new ubuntu distribution
from the apt package. I seem to be having issues where emails that are
obviously spam are not being marked.

X-Spam-Status: No, score=0.0 required=5.0 tests=none autolearn=no
version=3.1.7-deb

Is in the headers of the email but the content of the email has URLs
which are in the blacklists, If I forward these emails to my ISP account
which has its own spam solution they get marked.

Here is the body of the email

=====SNIP=======

Hello, share

 >> Don't have time for a full time relationship?
Many young career minded people don't but still want a physical
relationship,
many of these need sexual encounters but without the frustrating
attachment of a boyfriend or girlfriend.
This means they have time to concentrate on their profession/career and
not worry about
what is going on at home, as essentially they are single.
This is commonly becoming known as a
[geocities URL HERE]
skittle chesapeake boycott ripple grandchildren anglicanism flora
yaounde lawson, offshore inhere.
ampere terse hoofmark computation nero evildoer cause downcast, wolfish
squirehood
bucharest creamy marin, goa strand bulrush january.
fable ultimatum rate, cerise bluebonnet steiner travesty.

Your Tad.

=====SNIP=======

I have removed the geocities URL so that it wont potentially be marked
by users of this lists spam protection.

By spamassassin configuration is fairly basic and it looks like this


=====SNIP=======
cat /etc/spamassassin/local.cf

lock_method flock
required_score 5.0
trusted_networks 127.0.0.1
# clear_headers
# add_header all Flag _YESNOCAPS_
# add_header all Status _YESNO_, score=_SCORE_ required=_REQD_
add_header spam Flag _YESNOCAPS_
add_header all Status _YESNO_, score=_SCORE_ required=_REQD_
tests=_TESTS_ autolearn=_AUTOLEARN_ version=_VERSION_
add_header all Level _STARS(*)_
add_header all Checker-Version SpamAssassin _VERSION_ (_SUBVERSION_) on
_HOSTNAME_

rewrite_header Subject **SPAM**

skip_rbl_checks 0
report_safe 1
whitelist_from [EMAIL PROTECTED]
whitelist_from [EMAIL PROTECTED]
whitelist_from [EMAIL PROTECTED] users@spamassassin.apache.org
[EMAIL PROTECTED] [EMAIL PROTECTED]
bayes_min_ham_num 50
bayes_min_spam_num 50
bayes_use_hapaxes 1
use_bayes 1
use_auto_whitelist 0
bayes_auto_learn_threshold_spam 1.0
use_razor2 1
use_pyzor 1
ok_locales en


=====SNIP=======


I had previously being using the rules from saupdates.openprotect.com
but I have stopped using that service while I try and diagnose this
problem. With or without the rules I have exactly the same issues.

One line I am constantly seeing in the mail.log file is the following

Mar 28 09:09:34 poopey spamd[21715]: config: copying current conf from
backup

does that have any reference on the problem?

I also see the following

Mar 28 09:10:23 poopey spamd[21716]: bayes: not available for scanning,
only 5 spam(s) in bayes DB < 50

I find that a little strange as I have done an sa-learn for both ham and
spam emails on folders which I have moved all the spam messages to.

Any advice on resolving or how to diagnose these problems would be
greatly appreciated.

Regards,

Jimmy.

--
This message has been scanned for viruses and dangerous content by
MailScanner, and is
believed to be clean.




[EMAIL PROTECTED]:~$ spamassassin -D --lint
[25453] dbg: logger: adding facilities: all
[25453] dbg: logger: logging level is DBG
[25453] dbg: generic: SpamAssassin version 3.1.7-deb
[25453] dbg: config: score set 0 chosen.
[25453] dbg: util: running in taint mode? yes
[25453] dbg: util: taint mode: deleting unsafe environment variables, resetting PATH
[25453] dbg: util: PATH included '/usr/local/sbin', keeping
[25453] dbg: util: PATH included '/usr/local/bin', keeping
[25453] dbg: util: PATH included '/usr/sbin', keeping
[25453] dbg: util: PATH included '/usr/bin', keeping
[25453] dbg: util: PATH included '/sbin', keeping
[25453] dbg: util: PATH included '/bin', keeping
[25453] dbg: util: PATH included '/usr/bin/X11', keeping
[25453] dbg: util: PATH included '/usr/games', keeping
[25453] dbg: util: final PATH set to: /usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/bin/X11:/usr/games
[25453] dbg: message: ---- MIME PARSER START ----
[25453] dbg: message: main message type: text/plain
[25453] dbg: message: parsing normal part
[25453] dbg: message: added part, type: text/plain
[25453] dbg: message: ---- MIME PARSER END ----
[25453] dbg: dns: is Net::DNS::Resolver available? yes
[25453] dbg: dns: Net::DNS version: 0.53
[25453] dbg: diag: perl platform: 5.008007 linux
[25453] dbg: diag: module installed: DB_File, version 1.811
[25453] dbg: diag: module installed: HTML::Parser, version 3.48
[25453] dbg: diag: module installed: MIME::Base64, version 3.05
[25453] dbg: diag: module installed: Net::DNS, version 0.53
[25453] dbg: diag: module installed: Net::SMTP, version 2.29
[25453] dbg: diag: module installed: Digest::SHA1, version 2.10
[25453] dbg: diag: module installed: Mail::SPF::Query, version 1.997
[25453] dbg: diag: module installed: IP::Country::Fast, version 604.001
[25453] dbg: diag: module installed: Razor2::Client::Agent, version 2.72
[25453] dbg: diag: module installed: Net::Ident, version 1.20
[25453] dbg: diag: module not installed: IO::Socket::INET6 ('require' failed)
[25453] dbg: diag: module installed: IO::Socket::SSL, version 0.97
[25453] dbg: diag: module installed: Time::HiRes, version 1.66
[25453] dbg: diag: module installed: DBI, version 1.50
[25453] dbg: diag: module installed: Getopt::Long, version 2.34
[25453] dbg: diag: module installed: LWP::UserAgent, version 2.033
[25453] dbg: diag: module installed: HTTP::Date, version 1.46
[25453] dbg: diag: module installed: Archive::Tar, version 1.26
[25453] dbg: diag: module installed: IO::Zlib, version 1.04
[25453] dbg: ignore: using a test message to lint rules
[25453] dbg: config: using "/etc/spamassassin" for site rules pre files
[25453] dbg: config: read file /etc/spamassassin/init.pre
[25453] dbg: config: read file /etc/spamassassin/v310.pre
[25453] dbg: config: read file /etc/spamassassin/v312.pre
[25453] dbg: config: using "/usr/share/spamassassin" for sys rules pre files [25453] dbg: config: using "/usr/share/spamassassin" for default rules dir
[25453] dbg: config: read file /usr/share/spamassassin/10_misc.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_advance_fee.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_anti_ratware.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_body_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_compensate.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_dnsbl_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_drugs.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_fake_helo_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_head_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_html_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_meta_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_net_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_phrases.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_porn.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_ratware.cf
[25453] dbg: config: read file /usr/share/spamassassin/20_uri_tests.cf
[25453] dbg: config: read file /usr/share/spamassassin/23_bayes.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_accessdb.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_antivirus.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_body_tests_es.cf [25453] dbg: config: read file /usr/share/spamassassin/25_body_tests_pl.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_dcc.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_dkim.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_domainkeys.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_hashcash.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_pyzor.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_razor2.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_replace.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_spf.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_textcat.cf
[25453] dbg: config: read file /usr/share/spamassassin/25_uribl.cf
[25453] dbg: config: read file /usr/share/spamassassin/30_text_de.cf
[25453] dbg: config: read file /usr/share/spamassassin/30_text_fr.cf
[25453] dbg: config: read file /usr/share/spamassassin/30_text_it.cf
[25453] dbg: config: read file /usr/share/spamassassin/30_text_nl.cf
[25453] dbg: config: read file /usr/share/spamassassin/30_text_pl.cf
[25453] dbg: config: read file /usr/share/spamassassin/30_text_pt_br.cf
[25453] dbg: config: read file /usr/share/spamassassin/50_scores.cf
[25453] dbg: config: read file /usr/share/spamassassin/60_awl.cf
[25453] dbg: config: read file /usr/share/spamassassin/60_whitelist.cf
[25453] dbg: config: read file /usr/share/spamassassin/60_whitelist_dk.cf
[25453] dbg: config: read file /usr/share/spamassassin/60_whitelist_dkim.cf [25453] dbg: config: read file /usr/share/spamassassin/60_whitelist_spf.cf [25453] dbg: config: read file /usr/share/spamassassin/60_whitelist_subject.cf
[25453] dbg: config: read file /usr/share/spamassassin/65_debian.cf
[25453] dbg: config: using "/etc/spamassassin" for site rules dir
[25453] dbg: config: read file /etc/spamassassin/local.cf
[25453] dbg: config: using "/home/vpopmail/.spamassassin/user_prefs" for user prefs file
[25453] dbg: config: read file /home/vpopmail/.spamassassin/user_prefs
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::RelayCountry from @INC [25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::RelayCountry=HASH(0x93a62c0) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::URIDNSBL from @INC [25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x93e5d98) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::Hashcash from @INC [25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::Hashcash=HASH(0x93bee9c)
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::SPF from @INC
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::SPF=HASH(0x941edd4)
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::DCC from @INC
[25453] dbg: dcc: local tests only, disabling DCC
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::DCC=HASH(0x93f94f8)
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::Pyzor from @INC
[25453] dbg: pyzor: local tests only, disabling Pyzor
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::Pyzor=HASH(0x9475be0)
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::Razor2 from @INC
[25453] dbg: razor2: local tests only, skipping Razor
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::Razor2=HASH(0x947c2c8) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::SpamCop from @INC
[25453] dbg: reporter: local tests only, disabling SpamCop
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::SpamCop=HASH(0x94ad240)
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::AWL from @INC
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::AWL=HASH(0x94bbd74) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::TextCat from @INC
[25453] dbg: textcat: loading languages file...
[25453] dbg: textcat: loaded 73 language models
[25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::TextCat=HASH(0x9688ce0) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::WhiteListSubject from @INC [25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x992f864) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::MIMEHeader from @INC [25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::MIMEHeader=HASH(0x993056c) [25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::ReplaceTags from @INC [25453] dbg: plugin: registered Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0x98fd504)
[25453] dbg: plugin: loading Mail::SpamAssassin::Plugin::SPF from @INC
[25453] dbg: plugin: did not register Mail::SpamAssassin::Plugin::SPF=HASH(0x98fd57c), already registered [25453] dbg: config: adding redirector regex: /^http:\/\/chkpt\.zdnet\.com\/chkpt\/\w+\/(.*)$/i [25453] dbg: config: adding redirector regex: /^http:\/\/www(?:\d+)?\.nate\.com\/r\/\w+\/(.*)$/i [25453] dbg: config: adding redirector regex: /^http:\/\/.+\.gov\/(?:.*\/)?externalLink\.jhtml\?.*url=(.*?)(?:&.*)?$/i [25453] dbg: config: adding redirector regex: /^http:\/\/redir\.internet\.com\/.+?\/.+?\/(.*)$/i [25453] dbg: config: adding redirector regex: /^http:\/\/(?:.*?\.)?adtech\.de\/.*(?:;|\|)link=(.*?)(?:;|$)/i [25453] dbg: config: adding redirector regex: m'^http.*?/redirect\.php\?.*(?<=[?&])goto=(.*?)(?:$|[&#])'i [25453] dbg: config: adding redirector regex: m'^https?:/*(?:[^/]+\.)?emf\d\.com/r\.cfm.*?&r=(.*)'i [25453] dbg: config: adding redirector regex: m'/(?:index.php)?\?.*(?<=[?&])URL=(.*?)(?:$|[&#])'i [25453] dbg: config: adding redirector regex: m'^http:/*(?:\w+\.)?google(?:\.\w{2,3}){1,2}/url\?.*?(?<=[?&])q=(.*?)(?:$|[&#])'i [25453] dbg: config: adding redirector regex: m'^http:/*(?:\w+\.)?google(?:\.\w{2,3}){1,2}/search\?.*?(?<=[?&])q=[^&]*?(?<=%20|..[=+\s])site:(.*?)(?:$|%20|[\s+&#])'i [25453] dbg: config: adding redirector regex: m'^http:/*(?:\w+\.)?google(?:\.\w{2,3}){1,2}/search\?.*?(?<=[?&])q=[^&]*?(?<=%20|..[=+\s])(?:"|%22)(.*?)(?:$|%22|["\s+&#])'i [25453] dbg: config: adding redirector regex: m'^http:/*(?:\w+\.)?google(?:\.\w{2,3}){1,2}/translate\?.*?(?<=[?&])u=(.*?)(?:$|[&#])'i [25453] dbg: plugin: Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0x98fd504) implements 'finish_parsing_end'
[25453] dbg: replacetags: replacing tags
[25453] dbg: replacetags: done replacing tags
[25453] dbg: bayes: tie-ing to DB file R/O /home/vpopmail/.spamassassin/bayes_toks [25453] dbg: bayes: tie-ing to DB file R/O /home/vpopmail/.spamassassin/bayes_seen
[25453] dbg: bayes: found bayes db version 3
[25453] dbg: bayes: DB journal sync: last sync: 0
[25453] dbg: bayes: not available for scanning, only 5 spam(s) in bayes DB < 50
[25453] dbg: bayes: untie-ing
[25453] dbg: bayes: untie-ing db_toks
[25453] dbg: bayes: untie-ing db_seen
[25453] dbg: config: score set 0 chosen.
[25453] dbg: message: ---- MIME PARSER START ----
[25453] dbg: message: main message type: text/plain
[25453] dbg: message: parsing normal part
[25453] dbg: message: added part, type: text/plain
[25453] dbg: message: ---- MIME PARSER END ----
[25453] dbg: bayes: tie-ing to DB file R/O /home/vpopmail/.spamassassin/bayes_toks [25453] dbg: bayes: tie-ing to DB file R/O /home/vpopmail/.spamassassin/bayes_seen
[25453] dbg: bayes: found bayes db version 3
[25453] dbg: bayes: DB journal sync: last sync: 0
[25453] dbg: bayes: not available for scanning, only 5 spam(s) in bayes DB < 50
[25453] dbg: bayes: untie-ing
[25453] dbg: bayes: untie-ing db_toks
[25453] dbg: bayes: untie-ing db_seen
[25453] dbg: dns: is DNS available? 0
[25453] dbg: metadata: X-Spam-Relays-Trusted:
[25453] dbg: metadata: X-Spam-Relays-Untrusted:
[25453] dbg: metadata: X-Spam-Relays-Internal:
[25453] dbg: metadata: X-Spam-Relays-External:
[25453] dbg: plugin: Mail::SpamAssassin::Plugin::RelayCountry=HASH(0x93a62c0) implements 'extract_metadata' [25453] dbg: plugin: Mail::SpamAssassin::Plugin::TextCat=HASH(0x9688ce0) implements 'extract_metadata'
[25453] dbg: metadata: X-Relay-Countries:
[25453] dbg: message: no encoding detected
[25453] dbg: textcat: classifying, skipping: yi sco lv is bs sl la ga sa eu et rm cy eo fy gd lt
[25453] dbg: textcat: language possibly: en
[25453] dbg: textcat: X-Languages: "en", X-Languages-Length: 1342
[25453] dbg: plugin: Mail::SpamAssassin::Plugin::RelayCountry=HASH(0x93a62c0) implements 'parsed_metadata' [25453] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x93e5d98) implements 'parsed_metadata'
[25453] dbg: rules: local tests only, ignoring RBL eval
[25453] dbg: check: running tests for priority: 0
[25453] dbg: rules: running header regexp tests; score so far=0
[25453] dbg: rules: ran header rule __HAS_MSGID ======> got hit: "<"
[25453] dbg: rules: ran header rule __SANE_MSGID ======> got hit: "<[EMAIL PROTECTED]>
[25453] dbg: rules: "
[25453] dbg: rules: ran header rule NO_REAL_NAME ======> got hit: "[EMAIL PROTECTED]
[25453] dbg: rules: "
[25453] dbg: rules: ran header rule __MSGID_OK_HOST ======> got hit: "@lint_rules>" [25453] dbg: rules: ran header rule __MSGID_OK_DIGITS ======> got hit: "1175070965" [25453] dbg: eval: all '*From' addrs: [EMAIL PROTECTED]
[25453] dbg: eval: all '*To' addrs:
[25453] dbg: rules: ran eval rule NO_RELAYS ======> got hit
[25453] dbg: rules: ran eval rule __UNUSABLE_MSGID ======> got hit
[25453] dbg: rules: running body-text per-line regexp tests; score so far=-1.001
[25453] dbg: rules: ran body rule __NONEMPTY_BODY ======> got hit: "I"
[25453] dbg: uri: running uri tests; score so far=-1.001
[25453] dbg: bayes: tie-ing to DB file R/O /home/vpopmail/.spamassassin/bayes_toks [25453] dbg: bayes: tie-ing to DB file R/O /home/vpopmail/.spamassassin/bayes_seen
[25453] dbg: bayes: found bayes db version 3
[25453] dbg: bayes: DB journal sync: last sync: 0
[25453] dbg: bayes: not available for scanning, only 5 spam(s) in bayes DB < 50
[25453] dbg: bayes: not scoring message, returning undef
[25453] dbg: bayes: DB expiry: tokens in DB: 105828, Expiry max size: 150000, Oldest atime: 1174842548, Newest atime: 1175068063, Last expire: 0, Current time: 1175070966
[25453] dbg: bayes: DB journal sync: last sync: 0
[25453] dbg: bayes: untie-ing
[25453] dbg: bayes: untie-ing db_toks
[25453] dbg: bayes: untie-ing db_seen
[25453] dbg: rules: running raw-body-text per-line regexp tests; score so far=-1.001
[25453] dbg: rules: running full-text regexp tests; score so far=-1.001
[25453] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x93e5d98) implements 'check_tick'
[25453] dbg: check: running tests for priority: 500
[25453] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x93e5d98) implements 'check_post_dnsbl'
[25453] dbg: rules: running meta tests; score so far=-1.001
[25453] dbg: rules: running header regexp tests; score so far=0.866
[25453] dbg: rules: running body-text per-line regexp tests; score so far=0.866
[25453] dbg: uri: running uri tests; score so far=0.866
[25453] dbg: rules: running raw-body-text per-line regexp tests; score so far=0.866
[25453] dbg: rules: running full-text regexp tests; score so far=0.866
[25453] dbg: check: running tests for priority: 1000
[25453] dbg: rules: running meta tests; score so far=0.866
[25453] dbg: rules: running header regexp tests; score so far=0.866
[25453] dbg: rules: running body-text per-line regexp tests; score so far=0.866
[25453] dbg: uri: running uri tests; score so far=0.866
[25453] dbg: rules: running raw-body-text per-line regexp tests; score so far=0.866
[25453] dbg: rules: running full-text regexp tests; score so far=0.866
[25453] dbg: check: is spam? score=0.866 required=5
[25453] dbg: check: tests=MISSING_SUBJECT,NO_REAL_NAME,NO_RECEIVED,NO_RELAYS,TO_CC_NONE [25453] dbg: check: subtests=__HAS_MSGID,__MSGID_OK_DIGITS,__MSGID_OK_HOST,__NONEMPTY_BODY,__SANE_MSGID,__UNUSABLE_MSGID



Reply via email to