The implementation of HMM is still limited to BerkeleyDB. No additional setting 
is required. On windows simply install the packages from the distro and set 
useBerkeleyDB to on. The installation of the BDB software from oracle on 
windows is recommended to repair DB's, but this is not required.

Holding the HMM in memory requires up to 2GB RAM. But I'm currently also see, 
that the dynamic load of BDB for HMM leaks memory.
I plan to use additional tables in the main SQL database - this seems to be 
possible, I hope the performance penalty for the rebuildspamdb is not so bad. I 
also recommend to use the WORD-stemming-engine, if HMM is enabled (at least 
also for default Bayes usage).

So there is some work to do for the HMM to go in to full production.

What I currently need are some reports about the detection behavior of the HMM. 
I saw several mails, which where different detected by Bayes and HMM : Bayses: 
spam - HMM ham or via vers. In all those cases (100%) HMM was right and Bayes 
was wrong - but this should be normal, because HMM is much more complex -> much 
more exact.
Currently the HMM uses a very flat logic - to hold the memory usage and runtime 
as low as possible.
The number of tables required depends on the logic - so we first need to find 
the best logic, before we can improve the rest. 

Thomas


-----"GrayHat" <gray...@gmx.net> schrieb: -----An: "ASSP development mailing 
list" <assp-test@lists.sourceforge.net>
Von: "GrayHat" <gray...@gmx.net>
Datum: 24.10.2011 16:11
Betreff: Re: [Assp-test] fixes, changes and news in assp 2.1.2 build 11295

 
> 'DoHMM','Hidden Makrov Model Check 

> This needs a fully functional spamHMM and hamHMM BerkeleyDB
> database built by rebuildspamdb. For starters it is best practice to

Thomas; assuming one is running ASSP over a DB (say MySQL or
SQLserver or...) is there any need to create a new table or whatever
to allow ASSP using the SQL DB for those tables too ?


------------------------------------------------------------------------------
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test


DISCLAIMER:
*******************************************************
This email and any files transmitted with it may be confidential, legally 
privileged and protected in law and are intended solely for the use of the 
individual to whom it is addressed.
This email was multiple times scanned for viruses. There should be no known 
virus in this email!
*******************************************************


------------------------------------------------------------------------------
The demand for IT networking professionals continues to grow, and the
demand for specialized networking skills is growing even more rapidly.
Take a complimentary Learning@Cisco Self-Assessment and learn 
about Cisco certifications, training, and career opportunities. 
http://p.sf.net/sfu/cisco-dev2dev
_______________________________________________
Assp-test mailing list
Assp-test@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/assp-test

Reply via email to