Re: foreign languages

2008-04-10 Thread Arvid Ephraim Picciani
thanks Matt  and Mathus. That helps.

-- 
best regards/Mit freundlichen Grüßen
Arvid Ephraim Picciani


Re: foreign languages

2008-04-10 Thread Matt Kettler

Arvid Ephraim Picciani wrote:

greetings.
any ideas for spam in russian and chineese? (some even with broken charset)
XBL and bayes are very effective but not enough :/
I'd like to have some kind of language matcher. We don't have people speaking 
russian in the company so it would be nice to give 1 or 2 points on just the 
language.
  

Well, SpamAssassin has two tools to help here..

ok_locales will check character sets. By default it allows everything, 
but you can change it to only allow character sets that are appropriate 
for your locale.


Also, there's the TextCat plugin, which you'd have to un-comment in 
v310.pre. Once that's enabled, you can start using ok_languages, which 
tries to guess at the language of a message based on character combinations.


Please read the docs closely, as there are a lot more languages than 
locales, so what's valid for one isn't valid for the other. (There are 
lots of languages that all use the same character sets.)


http://spamassassin.apache.org/full/3.2.x/doc/Mail_SpamAssassin_Conf.html#language_options

http://spamassassin.apache.org/full/3.2.x/doc/Mail_SpamAssassin_Plugin_TextCat.html






Re: foreign languages

2008-04-10 Thread Matus UHLAR - fantomas
On 10.04.08 12:38, Arvid Ephraim Picciani wrote:
> any ideas for spam in russian and chineese? (some even with broken charset)
> XBL and bayes are very effective but not enough :/
> I'd like to have some kind of language matcher. We don't have people speaking 
> russian in the company so it would be nice to give 1 or 2 points on just the 
> language.

Look at TextCat plugin and ok_languages setting.

There's also ok_locale settigns which match the alphabet setting and does
not require any plugin...
-- 
Matus UHLAR - fantomas, [EMAIL PROTECTED] ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
Honk if you love peace and quiet. 


foreign languages

2008-04-10 Thread Arvid Ephraim Picciani
greetings.
any ideas for spam in russian and chineese? (some even with broken charset)
XBL and bayes are very effective but not enough :/
I'd like to have some kind of language matcher. We don't have people speaking 
russian in the company so it would be nice to give 1 or 2 points on just the 
language.
-- 
best regards
Arvid Ephraim Picciani


Re: Foreign Languages

2007-03-28 Thread John Thompson
On 2007-03-27, Nathan Brink <[EMAIL PROTECTED]> wrote:

> Does SPAM Assassin score what we boarheaded Americans consider to be
> "foreign language" email messages the same as it would English?

Are you looking for something that would score mail differently 
depending on the language used? Some of the SARE rules are designed to 
handle specific languages differently:

  http://www.rulesemporium.com/rules.htm

-- 

John ([EMAIL PROTECTED])



Re: Foreign Languages

2007-03-27 Thread Chris St. Pierre

On Tue, 27 Mar 2007, Nathan Brink wrote:


Does SPAM Assassin score what we boarheaded Americans consider to be
"foreign language" email messages the same as it would English?


SA uses a _lot_ of rules that score against English spam.  If you're
receiving spam in another language, then you'll need to translate the
rules, as it were.  SA does not have a language abstraction layer.

Spam detection is based heavily on content.  Content is mired in
language.

Chris St. Pierre
Unix Systems Administrator
Nebraska Wesleyan University




Foreign Languages

2007-03-27 Thread Nathan Brink
Does SPAM Assassin score what we boarheaded Americans consider to be
"foreign language" email messages the same as it would English?
 
Pardon my ignorance on the matter!
 
Thanks!!
Nate