Your message dated Mon, 08 Mar 2010 21:50:24 +0000
with message-id <[email protected]>
and subject line Bug#554172: fixed in crm114 20090807-5
has caused the Debian Bug report #554172,
regarding crm114: doesn't appear to classify some mails correctly after learning
to be marked as done.

This means that you claim that the problem has been dealt with.
If this is not the case it is now your responsibility to reopen the
Bug report if necessary, and/or fix the problem forthwith.

(NB: If you are a system administrator and have no idea what this
message is talking about, this may indicate a serious mail system
misconfiguration somewhere. Please contact [email protected]
immediately.)


-- 
554172: http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=554172
Debian Bug Tracking System
Contact [email protected] with problems
--- Begin Message ---
Package: crm114
Version: 20090807-1
Severity: normal

I updated to crm114, 20090807, BlameThorstenAndJenny today. Since cssutil 
complained:

 Minor Caution - this file has the learncount slot in use.
 This is not a problem for Markovian classification, but it will have some
 issues with an OSB classfier.

and I use a OSB classifier and also due to the debconf warning, I recreated the 
files from scratch.

It appears to learn:

mar...@shambhala:~/.crm114> ./stats.sh
Als SPAM markiert: 3

 Sparse spectra file spam.css statistics:

 Total available buckets          :      1048577
 Total buckets in use             :         1481
 Total in-use zero-count buckets  :            0
 Total buckets with value >= max  :            0
 Total hashed datums in file      :         1565
 Documents learned                :           40
 Features learned                 :         1566
 Average datums per bucket        :         1.06
 Maximum length of overflow chain :            2
 Average length of overflow chain :         1.00
 Average packing density          :         0.00

Als HAM markiert: 20

 Sparse spectra file nonspam.css statistics:

 Total available buckets          :      1048577
 Total buckets in use             :         8187
 Total in-use zero-count buckets  :            0
 Total buckets with value >= max  :            0
 Total hashed datums in file      :         8389
 Documents learned                :           41
 Features learned                 :         8390
 Average datums per bucket        :         1.02
 Maximum length of overflow chain :            2
 Average length of overflow chain :         1.01
 Average packing density          :         0.01


Script is:

mar...@shambhala:~/.crm114> cat ./stats.sh
#!/bin/sh

echo "Als SPAM markiert: $(find reaver_cache/known_spam/ | wc -l)"
cssutil -rb spam.css

echo "Als HAM markiert: $(find reaver_cache/known_good/ | wc -l)"
cssutil -rb nonspam.css


Yet CRM114 doesn't appear to recognize some mails correctly after learning.

I learn it:

mar...@shambhala:~/Zeit> cat Aufräumpolicy\ _tmp.mbox| crm -u ~/.crm114 
mailreaver.crm --good | grep CRM
/bin/ln: Erzeuge harte Verknüpfung 
„reaver_cache/known_good/20091103_160046_476304_CDA7397D“: Die Datei existiert 
bereits
X-CRM114-Version: 20090807-BlameThorstenAndJenny ( TRE 0.7.6 (BSD) ) MR-27CA1CFB
X-CRM114-CacheID: sfid-20091103_160046_476304_CDA7397D
X-CRM114-Notice: Please train this message.
X-CRM114-Action: LEARNED AND CACHED GOOD

This is what CRM114 thinks about it afterwards:

mar...@shambhala:~/Zeit> cat Aufräumpolicy\ _tmp.mbox| crm -u ~/.crm114 
mailreaver.crm | grep CRM
X-CRM114-Version: 20090807-BlameThorstenAndJenny ( TRE 0.7.6 (BSD) ) MR-27CA1CFB
X-CRM114-CacheID: sfid-20091103_160046_476304_CDA7397D
X-CRM114-Status: UNSURE (   8.14  )
X-CRM114-Notice: Please train this message.


These are my differences to the original configuration:

mar...@shambhala:~/.crm114> diff -u /usr/share/crm114/mailfilter.cf 
mailfilter.cf
--- /usr/share/crm114/mailfilter.cf     2009-08-07 17:22:37.000000000 +0200     
 
+++ mailfilter.cf       2009-11-03 16:11:45.779852564 +0100                     
 
@@ -169,8 +169,8 @@                                                             
 
 #  ---------  will be inserted at the front of the subject if we think the     
 
 #  ---------  mail is spam.                                                    
 
 #                                                                              
 
-# :spam_flag_subject_string: //                                                
 
-:spam_flag_subject_string: /ADV:/                                              
 
+:spam_flag_subject_string: //                                                  
 
+#:spam_flag_subject_string: /ADV:/                                             
 
                                                                                
 
 #  ---------  Do we want to insert a "flagging" string on the subject line     
 
 #  ---------  for good email?  Usually we don't.... so we set this to the      
 
@@ -180,13 +180,13 @@                                                           
 
 #  ------------Similarly, do we want to insert a "flagging" string on          
 
 #  -------------the subject line of an "unsure" email?  This way we know       
 
 #  --------------we need to train it even if "headers" is turned off.          
 
-# :unsure_flag_subject_string: //                                              
 
-:unsure_flag_subject_string: /UNS:/
+:unsure_flag_subject_string: //
+# :unsure_flag_subject_string: /UNS:/

 # ------------- Do we want Training ConFirmation flags on the results of
 # ------------- a message to be learned?  Default is "TCF:".
-:confirm_flag_subject_string: /TCF:/
-#:confirm_flag_subject_string: //
+#:confirm_flag_subject_string: /TCF:/
+:confirm_flag_subject_string: //


 # ---------  Do we want to do any "rewrites" to increase generality and
@@ -194,16 +194,16 @@
 #    --------- NOTE: this option is somewhat slow.  If your mailserver is
 #      --------- maxed out on CPU, you might want to turn this off.
 #
-:rewrites_enabled: /yes/
-#:rewrites_enabled: /no/
+#:rewrites_enabled: /yes/
+:rewrites_enabled: /no/


 #  ---------  Do we copy incoming text into allmail.txt ?  default is yes, but
 #   ---------  experienced users will probably set this to 'no' after testing
 #    ---------  their configuration for functionality.
 #
-:log_to_allmail.txt:  /yes/
-# :log_to_allmail.txt: /no/
+# :log_to_allmail.txt:  /yes/
+:log_to_allmail.txt: /no/

 #   -------  Another logging option - log all mail to somewhere else
 #    -------  entirely.  Whatever pathname is given here will be prefixed

-- System Information:
Debian Release: squeeze/sid
  APT prefers testing
  APT policy: (450, 'testing'), (400, 'unstable'), (101, 'experimental')
Architecture: i386 (i686)

Kernel: Linux 2.6.31.5-tp42-toi-3.0.1-04850-g4eddd0d (PREEMPT)
Locale: LANG=de_DE.UTF-8, LC_CTYPE=de_DE.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/dash

Versions of packages crm114 depends on:
ii  debconf [debconf-2.0]         1.5.28     Debian configuration management sy
ii  libc6                         2.9-25     GNU C Library: Shared libraries
ii  libtre4                       0.7.6-2    regexp matching library with appro

Versions of packages crm114 recommends:
ii  metamail                      2.7-54     implementation of MIME

crm114 suggests no packages.

-- debconf information:
* crm114/cssupgrade: true



--- End Message ---
--- Begin Message ---
Source: crm114
Source-Version: 20090807-5

We believe that the bug you reported is fixed in the latest version of
crm114, which is due to be installed in the Debian FTP archive:

crm114_20090807-5.debian.tar.gz
  to main/c/crm114/crm114_20090807-5.debian.tar.gz
crm114_20090807-5.dsc
  to main/c/crm114/crm114_20090807-5.dsc
crm114_20090807-5_amd64.deb
  to main/c/crm114/crm114_20090807-5_amd64.deb



A summary of the changes between this version and the previous one is
attached.

Thank you for reporting the bug, which will now be closed.  If you
have further comments please address them to [email protected],
and the maintainer will reopen the bug report if appropriate.

Debian distribution maintenance software
pp.
Milan Zamazal <[email protected]> (supplier of updated crm114 package)

(This message was generated automatically at their request; if you
believe that there is a problem with it please contact the archive
administrators by mailing [email protected])


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Format: 1.8
Date: Mon, 08 Mar 2010 20:07:46 +0100
Source: crm114
Binary: crm114
Architecture: source amd64
Version: 20090807-5
Distribution: unstable
Urgency: low
Maintainer: Milan Zamazal <[email protected]>
Changed-By: Milan Zamazal <[email protected]>
Description: 
 crm114     - versatile classifier for e-mail and other data
Closes: 554172 570769 572772
Changes: 
 crm114 (20090807-5) unstable; urgency=low
 .
   * Change default good_threshold value from 10 to 5; closes: #554172.
   * German and Vietnamese debconf translation updates; closes: #570769,
     #572772.
Checksums-Sha1: 
 d3ad0135fc8672375d27bacfd72502ffc6e655a8 1055 crm114_20090807-5.dsc
 36618bc104176543b78ba9793d29c72cfa8312ba 34948 crm114_20090807-5.debian.tar.gz
 849ef2b4b6a7a6acbfc0b254c9a06dd52111eace 570908 crm114_20090807-5_amd64.deb
Checksums-Sha256: 
 1af905ec9aaed387abc446799a47ba4e0ce6da7709d3432c14964c5cb8026d03 1055 
crm114_20090807-5.dsc
 ad0bc00091a6bc3c7dc1a023e25c3b73477aafc0b70a7e2b0c00aff810f001f9 34948 
crm114_20090807-5.debian.tar.gz
 a2f6d4db4cbd1e978b6b564b8d0c650543b8e3056067f5aae076bc5d1645e452 570908 
crm114_20090807-5_amd64.deb
Files: 
 4202eb3b5de1fb597d2080f8ba9f1c0a 1055 mail optional crm114_20090807-5.dsc
 f2aa93130bd88816d072205250349b2b 34948 mail optional 
crm114_20090807-5.debian.tar.gz
 e4a647ed4a85d72d0ef28370eeed6469 570908 mail optional 
crm114_20090807-5_amd64.deb

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.9 (GNU/Linux)

iEYEARECAAYFAkuVU8cACgkQkSkk/j3Cm0EmXACfVomaiTk+BLB9Zuvm3FcmbymH
NQoAoJ5M3qJbiBrl7l3KPl4ypmPqcHU+
=qJM7
-----END PGP SIGNATURE-----



--- End Message ---

Reply via email to