http://issues.apache.org/SpamAssassin/show_bug.cgi?id=3787





------- Additional Comments From [EMAIL PROTECTED]  2005-12-14 21:59 -------
I trimmed out the duplicate utf-8 warns in the debug below to leave 1 warn for 
every unique rule in 70_sare_obfu.cf that triggers the warn.  I wasnt running 
70_sare_adult.cf or 70_sare_specific during this test, so those obfu rules in 
there that trigger are not present in this debug.


# echo $LANG
en_US

# set | grep LC_
#

# perl -e 'use HTML::Parser; print HTML::Parser->VERSION . "\n"';
3.46

# svn info /tmp/spamassassin-trunk/
Path: /tmp/spamassassin-trunk
URL: http://svn.apache.org/repos/asf/spamassassin/trunk
Repository UUID: 13f79535-47bb-0310-9956-ffa450edef68
Revision: 356857

# spamassassin -V
SpamAssassin version 3.2.0-r356425
  running on Perl version 5.8.6

# ls -la /etc/mail/spamassassin/
-rw-r--r--  1 root root 158513 Oct  1 15:00 70_sare_obfu.cf
-rw-r--r--  1 root root    890 Sep 15 13:23 init.pre
-rw-r--r--  1 root root   1208 Sep 15 13:23 local.cf
-rw-r--r--  1 root root   2397 Sep 15 13:23 v310.pre

# ls -la /usr/share/spamassassin/
-rw-r--r--    1 root root   5495 Dec 14 14:02 10_default_prefs.cf
-rw-r--r--    1 root root  14312 Dec 14 14:02 20_dnsbl_tests.cf
-rw-r--r--    1 root root  17642 Dec 14 14:02 20_html_tests.cf
-rw-r--r--    1 root root   2164 Dec 14 14:02 20_net_tests.cf
-rw-r--r--    1 root root   2334 Dec 14 14:02 23_bayes.cf
-rw-r--r--    1 root root    420 Dec 14 14:02 25_accessdb.cf
-rw-r--r--    1 root root   1345 Dec 14 14:02 25_antivirus.cf
-rw-r--r--    1 root root    190 Dec 14 14:02 25_dcc.cf
-rw-r--r--    1 root root   1947 Dec 14 14:02 25_domainkeys.cf
-rw-r--r--    1 root root   2738 Dec 14 14:02 25_hashcash.cf
-rw-r--r--    1 root root    189 Dec 14 14:02 25_pyzor.cf
-rw-r--r--    1 root root   2201 Dec 14 14:02 25_razor2.cf
-rw-r--r--    1 root root   2873 Dec 14 14:02 25_spf.cf
-rw-r--r--    1 root root    352 Dec 14 14:02 25_textcat.cf
-rw-r--r--    1 root root   6544 Dec 14 14:02 25_uribl.cf
-rw-r--r--    1 root root   1116 Dec 14 14:02 60_awl.cf
-rw-r--r--    1 root root   4906 Dec 14 14:02 60_whitelist.cf
-rw-r--r--    1 root root   1726 Dec 14 14:02 60_whitelist_subject.cf
-rw-r--r--    1 root root 101479 Dec 14 14:02 languages
-rw-r--r--    1 root root  18944 Dec 14 14:02 triplets.txt
-rw-r--r--    1 root root   1869 Dec 14 14:02 user_prefs.template

# cat /root/test | spamc
X-Spam-Checker-Version: SpamAssassin 3.2.0-r356425 (2005-12-12) on
        asset.nmgi.com
X-Spam-Level: ****
X-Spam-Status: No, score=4.0 required=5.0 tests=HTML_60_70,HTML_MESSAGE,
        HTML_MISSING_CTYPE,HTML_SHORT_LENGTH autolearn=no
        version=3.2.0-r356425
Content-Type: text/html; charset=us-ascii

<html><body>
TUMS&reg; Smoothies&trade;
</body></html>




# spamd -D -L > spamd.out 2>&1 ^C
# cat spamd.out

[19475] dbg: logger: adding facilities: all
[19475] dbg: logger: logging level is DBG
[19475] dbg: logger: trying to connect to syslog/unix...
[19475] dbg: logger: opening syslog with unix socket
[19475] dbg: logger: successfully connected to syslog/unix
[19475] dbg: logger: successfully added syslog method
[19475] dbg: spamd: creating INET socket:
[19475] dbg: spamd:  Listen: 128
[19475] dbg: spamd:  LocalAddr: 127.0.0.1
[19475] dbg: spamd:  LocalPort: 783
[19475] dbg: spamd:  Proto: 6
[19475] dbg: spamd:  ReuseAddr: 1
[19475] dbg: spamd:  Type: 1
[19475] dbg: logger: adding facilities: all
[19475] dbg: logger: logging level is DBG
[19475] dbg: generic: SpamAssassin version 3.2.0-r356425
[19475] dbg: config: score set 0 chosen.
[19475] dbg: dns: no ipv6
[19475] dbg: dns: is Net::DNS::Resolver available? yes
[19475] dbg: dns: Net::DNS version: 0.49
[19475] dbg: dns: name server: 172.17.1.10, LocalAddr: 0.0.0.0
[19475] dbg: spamd: Preloading modules with HOME=/tmp/spamd-19475-init
[19475] dbg: ignore: test message to precompile patterns and load modules
[19475] dbg: config: using "/etc/mail/spamassassin" for site rules pre files
[19475] dbg: config: read file /etc/mail/spamassassin/init.pre
[19475] dbg: config: read file /etc/mail/spamassassin/v310.pre
[19475] dbg: config: using "/usr/share/spamassassin" for sys rules pre files
[19475] dbg: config: using "/usr/share/spamassassin" for default rules dir
[19475] dbg: config: read file /usr/share/spamassassin/10_default_prefs.cf
[19475] dbg: config: read file /usr/share/spamassassin/20_dnsbl_tests.cf
[19475] dbg: config: read file /usr/share/spamassassin/20_html_tests.cf
[19475] dbg: config: read file /usr/share/spamassassin/20_net_tests.cf
[19475] dbg: config: read file /usr/share/spamassassin/23_bayes.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_accessdb.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_antivirus.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_dcc.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_domainkeys.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_hashcash.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_pyzor.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_razor2.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_spf.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_textcat.cf
[19475] dbg: config: read file /usr/share/spamassassin/25_uribl.cf
[19475] dbg: config: read file /usr/share/spamassassin/60_awl.cf
[19475] dbg: config: read file /usr/share/spamassassin/60_whitelist.cf
[19475] dbg: config: read file /usr/share/spamassassin/60_whitelist_subject.cf
[19475] dbg: config: using "/etc/mail/spamassassin" for site rules dir
[19475] dbg: config: read file /etc/mail/spamassassin/70_sare_obfu.cf
[19475] dbg: config: read file /etc/mail/spamassassin/local.cf
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::URIDNSBL from @INC
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::URIDNSBL=HASH
(0x8d3a2c0)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::Hashcash from @INC
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::Hashcash=HASH
(0x8d28048)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::SPF from @INC
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::SPF=HASH(0x8d69c30)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::Pyzor from @INC
[19475] dbg: pyzor: local tests only, disabling Pyzor
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::Pyzor=HASH
(0x8e5e4f4)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::SpamCop from @INC
[19475] dbg: reporter: local tests only, disabling SpamCop
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::SpamCop=HASH
(0x8ed6dd0)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::AWL from @INC
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::AWL=HASH(0x8efd740)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::AutoLearnThreshold 
from @INC
[19475] dbg: plugin: registered 
Mail::SpamAssassin::Plugin::AutoLearnThreshold=HASH(0x8f08558)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::WhiteListSubject from 
@INC
[19475] dbg: plugin: registered 
Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x8f14a80)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::MIMEHeader from @INC
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::MIMEHeader=HASH
(0x8f1e3fc)
[19475] dbg: plugin: loading Mail::SpamAssassin::Plugin::ReplaceTags from @INC
[19475] dbg: plugin: registered Mail::SpamAssassin::Plugin::ReplaceTags=HASH
(0x8f2e1b0)
[19475] dbg: plugin: Mail::SpamAssassin::Plugin::ReplaceTags=HASH(0x8f2e1b0) 
implements 'finish_parsing_end'
[19475] dbg: replacetags: replacing tags
[19475] dbg: replacetags: done replacing tags
[19475] dbg: bayes: no dbs present, cannot tie DB R/O: /tmp/spamd-19475-
init/.spamassassin/bayes_toks
[19475] dbg: config: score set 0 chosen.
[19475] dbg: message: ---- MIME PARSER START ----
[19475] dbg: message: main message type: text/plain
[19475] dbg: message: parsing normal part
[19475] dbg: message: added part, type: text/plain
[19475] dbg: message: ---- MIME PARSER END ----
[19475] dbg: bayes: no dbs present, cannot tie DB R/O: /tmp/spamd-19475-
init/.spamassassin/bayes_toks
[19475] dbg: dns: is DNS available? 0
[19475] dbg: metadata: X-Spam-Relays-Trusted:
[19475] dbg: metadata: X-Spam-Relays-Untrusted:
[19475] dbg: message: no encoding detected
[19475] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8d3a2c0) 
implements 'parsed_metadata'
[19475] dbg: rules: local tests only, ignoring RBL eval
[19475] dbg: check: running tests for priority: 0
[19475] dbg: rules: running header regexp tests; score so far=0
[19475] dbg: plugin: registering glue method for check_hashcash_value 
(Mail::SpamAssassin::Plugin::Hashcash=HASH(0x8d28048))
[19475] dbg: plugin: registering glue method for check_hashcash_double_spend 
(Mail::SpamAssassin::Plugin::Hashcash=HASH(0x8d28048))
[19475] dbg: eval: all '*From' addrs: [EMAIL PROTECTED]
[19475] dbg: eval: all '*To' addrs:
[19475] dbg: plugin: registering glue method for check_subject_in_blacklist 
(Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x8f14a80))
[19475] dbg: plugin: registering glue method for check_subject_in_whitelist 
(Mail::SpamAssassin::Plugin::WhiteListSubject=HASH(0x8f14a80))
[19475] dbg: rules: running body-text per-line regexp tests; score so far=0
[19475] dbg: uri: running uri tests; score so far=0
[19475] dbg: rules: running raw-body-text per-line regexp tests; score so far=0
[19475] dbg: rules: running full-text regexp tests; score so far=0
[19475] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8d3a2c0) 
implements 'check_tick'
[19475] dbg: check: running tests for priority: 500
[19475] dbg: plugin: Mail::SpamAssassin::Plugin::URIDNSBL=HASH(0x8d3a2c0) 
implements 'check_post_dnsbl'
[19475] dbg: rules: running meta tests; score so far=0
[19475] dbg: rules: running header regexp tests; score so far=0
[19475] dbg: rules: running body-text per-line regexp tests; score so far=0
[19475] dbg: uri: running uri tests; score so far=0
[19475] dbg: rules: running raw-body-text per-line regexp tests; score so far=0
[19475] dbg: rules: running full-text regexp tests; score so far=0
[19475] dbg: check: running tests for priority: 1000
[19475] dbg: rules: running meta tests; score so far=0
[19475] dbg: rules: running header regexp tests; score so far=0
[19475] dbg: plugin: registering glue method for check_from_in_auto_whitelist 
(Mail::SpamAssassin::Plugin::AWL=HASH(0x8efd740))
[19475] dbg: locker: safe_lock: created /tmp/spamd-19475-
init/.spamassassin/auto-whitelist.lock.asset.nmgi.com.19475
[19475] dbg: locker: safe_lock: trying to get lock on /tmp/spamd-19475-
init/.spamassassin/auto-whitelist with 0 retries
[19475] dbg: locker: safe_lock: link to /tmp/spamd-19475-
init/.spamassassin/auto-whitelist.lock: link ok
[19475] dbg: auto-whitelist: tie-ing to DB file of type DB_File R/W 
in /tmp/spamd-19475-init/.spamassassin/auto-whitelist
[19475] dbg: auto-whitelist: db-based 
[EMAIL PROTECTED]|ip=none scores 0/0
[19475] dbg: auto-whitelist: AWL active, pre-score: 0, autolearn score: 0, 
mean: undef, IP: undef
[19475] dbg: auto-whitelist: DB addr list: untie-ing and unlocking
[19475] dbg: auto-whitelist: DB addr list: file locked, breaking lock
[19475] dbg: locker: safe_unlock: unlink /tmp/spamd-19475-
init/.spamassassin/auto-whitelist.lock
[19475] dbg: auto-whitelist: post auto-whitelist score: 0
[19475] dbg: rules: running body-text per-line regexp tests; score so far=0
[19475] dbg: uri: running uri tests; score so far=0
[19475] dbg: rules: running raw-body-text per-line regexp tests; score so far=0
[19475] dbg: rules: running full-text regexp tests; score so far=0
[19475] dbg: check: is spam? score=0 required=5
[19475] dbg: check: tests=
[19475] dbg: check: subtests=
[19475] dbg: config: copying current conf to backup
[19475] info: spamd: server started on port 783/tcp (running version 3.2.0-
r356425)
[19475] info: spamd: server pid: 19475
[19475] info: spamd: server successfully spawned child process, pid 19478
[19475] dbg: prefork: child 19478: entering state 0
[19478] dbg: prefork: sysread(8) not ready, wait max 300 secs
[19475] dbg: prefork: new lowest idle kid: none
[19479] dbg: prefork: sysread(9) not ready, wait max 300 secs
[19475] info: spamd: server successfully spawned child process, pid 19479
[19475] dbg: prefork: child 19479: entering state 0
[19475] dbg: prefork: new lowest idle kid: none
[19475] dbg: prefork: child 19478: entering state 1
[19475] dbg: prefork: new lowest idle kid: 19478
[19475] dbg: prefork: child reports idle
[19475] dbg: prefork: child 19479: entering state 1
[19475] dbg: prefork: new lowest idle kid: 19478
[19475] dbg: prefork: child reports idle
[19475] info: prefork: child states: II
[19475] dbg: prefork: ordered 19478 to accept
[19475] dbg: prefork: sysread(7) not ready, wait max 300 secs
[19478] info: spamd: connection from localhost.localdomain [127.0.0.1] at port 
34629
[19475] dbg: prefork: child 19478: entering state 2
[19475] dbg: prefork: new lowest idle kid: 19479
[19478] info: spamd: setuid to root succeeded
[19478] dbg: info: user has changed
[19478] dbg: bayes: no dbs present, cannot tie DB 
R/O: /root/.spamassassin/bayes_toks
[19478] dbg: config: score set 0 chosen.
[19478] warn: spamd: still running as root: user not specified with -u, not 
found, or set to root, falling back to nobody at /usr/bin/spamd line 1152, 
<GEN5>
line 4.
[19478] info: spamd: processing message (unknown) for root:99
[19478] dbg: dns: name server: 172.17.1.10, LocalAddr: 0.0.0.0
[19478] dbg: bayes: no dbs present, cannot tie DB 
R/O: /root/.spamassassin/bayes_toks
[19478] dbg: metadata: X-Spam-Relays-Trusted:
[19478] dbg: metadata: X-Spam-Relays-Untrusted:
[19478] dbg: message: ---- MIME PARSER START ----
[19478] dbg: message: main message type: text/html
[19478] dbg: message: parsing normal part
[19478] dbg: message: added part, type: text/html
[19478] dbg: message: ---- MIME PARSER END ----
[19478] dbg: message: no encoding detected
[19478] dbg: rules: local tests only, ignoring RBL eval
[19478] dbg: check: running tests for priority: 0
[19478] dbg: rules: running header regexp tests; score so far=0
[19478] dbg: eval: all '*From' addrs:
[19478] dbg: eval: all '*To' addrs:
[19478] dbg: rules: running body-text per-line regexp tests; score so far=0
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xcf) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_PRICE1, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xd1) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_PRESCR_SPL1, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xd5) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_SOFT2, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xce) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_VICODIN, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xd0) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_CIALIS2, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xd1) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_PRESCRIP, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xce) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_VISIT1, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xd2) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_XANAX, line 1, <GEN5> 
line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xc4) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule SARE_OBFU_GUARANTEE, line 1, 
<GEN5> line 10.
[19478] warn: Malformed UTF-8 character (unexpected non-continuation byte 0x00, 
immediately after start byte 0xd0) in pattern match (m//) 
at /etc/mail/spamassassin/70_sare_obfu.cf, rule __SARE_OBFU_MEDS2, line 1, 
<GEN5> line 10.
[19478] dbg: uri: running uri tests; score so far=0
[19478] dbg: rules: ran eval rule HTML_SHORT_LENGTH ======> got hit (1)
[19478] dbg: rules: ran eval rule __HTML_LENGTH_512 ======> got hit (1)
[19478] dbg: rules: ran eval rule HTML_60_70 ======> got hit (1)
[19478] dbg: bayes: no dbs present, cannot tie DB 
R/O: /root/.spamassassin/bayes_toks
[19478] dbg: bayes: not scoring message, returning undef
[19478] dbg: bayes: opportunistic call attempt failed, DB not readable
[19478] dbg: rules: ran eval rule HTML_MESSAGE ======> got hit (1)
[19478] dbg: rules: ran eval rule __HTML_LENGTH_384 ======> got hit (1)
[19478] dbg: rules: ran eval rule __HTML_LENGTH_0000_1024 ======> got hit (1)
[19478] dbg: rules: running raw-body-text per-line regexp tests; score so far=3
[19478] dbg: rules: running full-text regexp tests; score so far=3
[19478] dbg: check: running tests for priority: 500
[19478] dbg: rules: running meta tests; score so far=3
[19478] dbg: rules: running header regexp tests; score so far=4
[19478] dbg: rules: running body-text per-line regexp tests; score so far=4
[19478] dbg: uri: running uri tests; score so far=4
[19478] dbg: rules: running raw-body-text per-line regexp tests; score so far=4
[19478] dbg: rules: running full-text regexp tests; score so far=4
[19478] dbg: check: running tests for priority: 1000
[19478] dbg: rules: running meta tests; score so far=4
[19478] dbg: rules: running header regexp tests; score so far=4
[19478] dbg: rules: running body-text per-line regexp tests; score so far=4
[19478] dbg: uri: running uri tests; score so far=4
[19478] dbg: rules: running raw-body-text per-line regexp tests; score so far=4
[19478] dbg: rules: running full-text regexp tests; score so far=4
[19478] dbg: plugin: Mail::SpamAssassin::Plugin::AutoLearnThreshold=HASH
(0x8f08558) implements 'autolearn_discriminator'
[19478] dbg: learn: auto-learn: currently using scoreset 0
[19478] dbg: learn: auto-learn: message score: 4, computed score for autolearn: 
4
[19478] dbg: learn: auto-learn? ham=0.1, spam=12, body-points=3, head-points=0, 
learned-points=0
[19478] dbg: learn: auto-learn? no: inside auto-learn thresholds, not 
considered ham or spam
[19478] dbg: check: is spam? score=4 required=5
[19478] dbg: check: 
tests=HTML_60_70,HTML_MESSAGE,HTML_MISSING_CTYPE,HTML_SHORT_LENGTH
[19478] dbg: check: 
subtests=__HTML_LENGTH_0000_1024,__HTML_LENGTH_384,__HTML_LENGTH_512
[19478] info: spamd: clean message (4.0/5.0) for root:99 in 0.0 seconds, 99 
bytes.
[19478] info: spamd: result: .  4 - 
HTML_60_70,HTML_MESSAGE,HTML_MISSING_CTYPE,HTML_SHORT_LENGTH 
scantime=0.0,size=99,user=root,uid=99,required_score=5.0,rhos
t=localhost.localdomain,raddr=127.0.0.1,rport=34629,mid=(unknown),autolearn=no
[19478] dbg: config: copying current conf from backup
[19475] dbg: prefork: child 19478: entering state 1
[19475] dbg: prefork: new lowest idle kid: 19478
[19475] dbg: prefork: child reports idle
[19475] info: prefork: child states: II
[19478] dbg: prefork: sysread(8) not ready, wait max 300 secs
[19475] info: spamd: server killed by SIGINT, shutting down




------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.

Reply via email to