https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6703

--- Comment #23 from Kevin A. McGrail <[email protected]> ---
Created attachment 5070
  --> https://issues.apache.org/SpamAssassin/attachment.cgi?id=5070&action=edit
Patch to add options for defining the ArchiveIterator From regex as a Conf file
option

> How about:
> 
> /^From \S+  ?[[:upper:]][[:lower:]]{2}(?:, \d\d [[:upper:]][[:lower:]]{2}
> \d{4} [0-2]\d:\d\d:\d\d [+-]\d{4}| [[:upper:]][[:lower:]]{2} [ 1-3]\d [
> 0-2]\d:\d\d:\d\d \d{4})/
> 
> I'm assuming [:upper:] and [:lower:] will match accented characters
> properly. I haven't tested that assumption.

I don't know enough about foreign languages to know for sure the format is
always leading caps, etc. So I went ahead and wrote the patch to move this to a
configurable option.

It appears to work testing with the mbox with 3 ham messages attached
previously.  

"Learned tokens from 3 message(s) (3 message(s) examined)"

Thoughts?

KAM

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to