https://issues.apache.org/SpamAssassin/show_bug.cgi?id=6703

--- Comment #24 from Thomas Arend <[email protected]> ---


 (In reply to comment #22)
> (In reply to comment #20)
> > (In reply to comment #19)
> > > Upps there was a small copy error. There must be a ^ before "From "
> > > 
> > > /^From \S+  ?(Mon|Tue|Wed|Thu|Fri|Sat|Sun)(, \d\d
> > > (Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep|Oct|Nov|Dec) \d{4} [0-2]\d:\d\d:\d\d
> > > [+-]\d{4}| (Jan|Feb|Mar|Apr|May|Jun|Jul|Aug|Sep|Oct|Nov|Dec) [ 1-3]\d [
> > > 0-2]\d:\d\d:\d\d \d{4})/ )
> > 
> > This regex scares me because of the localization issue.  For example, Lun
> > for Monday in Spanish.
> 
> Agreed.
> 
> How about:
> 
> /^From \S+  ?[[:upper:]][[:lower:]]{2}(?:, \d\d [[:upper:]][[:lower:]]{2}
> \d{4} [0-2]\d:\d\d:\d\d [+-]\d{4}| [[:upper:]][[:lower:]]{2} [ 1-3]\d [
> 0-2]\d:\d\d:\d\d \d{4})/
> 
> I'm assuming [:upper:] and [:lower:] will match accented characters
> properly. I haven't tested that assumption.

Are you sure that we have a localization issue in the header fields? I use the
German versions of Outlook Express, Thunderbird, Kmail and Evolution. The time
stamps in the header are not localized.

-- 
You are receiving this mail because:
You are the assignee for the bug.

Reply via email to