[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-09-14 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

Thomas Arend  changed:

   What|Removed |Added

 CC||tho...@arend-rhb.de

--- Comment #8 from Thomas Arend  ---
The bug is still in 4.9.1

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-07-28 Thread Martin Koller
https://bugs.kde.org/show_bug.cgi?id=297198

Martin Koller  changed:

   What|Removed |Added

 Status|UNCONFIRMED |NEW
 CC||kol...@aon.at
 Ever confirmed|0   |1

--- Comment #7 from Martin Koller  ---
confirmed in comment #1

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-05-16 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

--- Comment #6 from Thomas Arend  ---
Comment from Mark Martinec 2012-05-16 13:42:01 UTC from
https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6703#c6

> When saving messages with kmail 1 the From Line has following format
> which is not iaw RFC 822:
>   From i...@ende-18-06.com Fri Jun 17 16:03:07 2011
> With kmail 2 the format is changed to the format which is iaw RFC 822
>   From tho...@arend-rhb.de Tue, 15 May 2012 22:01:41 +0200
> which is not parsed correctly by sa-learn. sa-learn --spam reports: [...]

Oh, no, not yet another incompatible mbox format!!!

> Proposed to change the behavior in a way that the old malformed From lines
> and the new correct ones are parsed.

It is the other way around, the new one differs from everybody else.

The format of the mbox file (along with its separator From_ lines)
is *not* governed by RFC 822 or its successors. There is no formal
standard for an mbox format, the RFC 4155 comes closest:
  http://tools.ietf.org/html/rfc4155
See also a Wikipedia article:
  http://en.wikipedia.org/wiki/Mbox


RFC 4155 says:

| a timestamp indicating the UTC date and time when the message
| was originally received, conformant with the syntax of the
| traditional UNIX 'ctime' output sans timezone (note that the
| use of UTC precludes the need for a timezone indicator);

This matches qmail docs:
  http://qmail.org/qmail-manual-html/man5/mbox.html
and matches Postfix and sendmail's local delivery agent.


To accommodate the new incompatible format it seems that the
two instances of a regexps in ArchiveIterator.pm need to be
extended, or just relaxed. Not sure if the date would still
be correctly parsed.

Best would be to persuade kmail folks to back off the change!

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-05-16 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

--- Comment #5 from Thomas Arend  ---
See bug report to SpamAssassin #6703
(https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6703)

See http://tools.ietf.org/html/rfc4155 which states that "A comprehensive
description of mbox database files on UNIX-like systems can be found at
http://qmail.org./man/man5/mbox.html, which should be treated as mostly
authoritative ..."

man 5 mbox defines that the date time stamp of the From_ line shopuld be in
ctime format. 

Therefore I propose to switch back to the old  date time stamp format of kmail
(Version 1.x).

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-05-15 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

--- Comment #4 from Thomas Arend  ---
There is a simple difference in the messages which causes the problem.

kmail1 used the following format of the "From" line:
  From tho...@arend-rhb.de Tue May 15 22:01:41 2012

kmail2 used the format:
  From tho...@arend-rhb.de Tue, 15 May 2012 22:01:41 +0200

If I change the "From"-line to the older format, sa-learn can learn the
messages!

kmail1 is not iaw RFC 822; which kmail2 is. Sticking to the standard the
problem is with spamassassin.

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-05-15 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

Thomas Arend  changed:

   What|Removed |Added

Version|4.7 |4.8.3

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-05-15 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

--- Comment #3 from Thomas Arend  ---
Created attachment 71118
  --> https://bugs.kde.org/attachment.cgi?id=71118&action=edit
Message saved from kmail Version 4.8.3

This message was received and stored with kmail2 Version 4.8.3

sa-learn --spam --mbox gives
Learned tokens from 0 message(s) (0 message(s) examined)

Which we do not expect. 

Both checks sa-learn were run on the same computer with spamassassin 3.3.1!

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-05-15 Thread Thomas Arend
https://bugs.kde.org/show_bug.cgi?id=297198

--- Comment #2 from Thomas Arend  ---
Created attachment 71117
  --> https://bugs.kde.org/attachment.cgi?id=71117&action=edit
Message saved from kmail Version 1.13.6

This message was received and saved from

KMail
Version 1.13.6
Unter KDE 4.6.00 (4.6.0) "release 6"

sa-learn --spam --mbox reports
Learned tokens from 1 message(s) (1 message(s) examined)

Which is what we expect!

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs


[Bug 297198] SPAMASSASSIN sa-learn can not learn mails exported to mbox format

2012-04-06 Thread Laurent Montel
https://bugs.kde.org/show_bug.cgi?id=297198

Laurent Montel  changed:

   What|Removed |Added

 CC||mon...@kde.org

--- Comment #1 from Laurent Montel  ---
I confirm it.

-- 
You are receiving this mail because:
You are the assignee for the bug.
___
Kdepim-bugs mailing list
Kdepim-bugs@kde.org
https://mail.kde.org/mailman/listinfo/kdepim-bugs