You may also want to look at bayesian statistics, support vector machines, and machine learning algorithms.
Dennis kauu wrote: > this is exactly also what i wander > > On 12/5/06, chad savage <[EMAIL PROTECTED]> wrote: >> >> Hello All, >> >> I'm doing some research on how to classify documents into pre-defined >> categories. >> Some methods I have come across are Ontologies, topic maps, url/site >> based and simple keyword analysis. >> I'm leaning towards topic maps and Ontologies being the strongest and >> most documented in theory and in practice. >> Does the group have any recommendations on where to start? >> Software packages to help develop the owl/rdf files? Protoge? >> Any consultancies out there that handle this process? >> Downfalls to using these? >> And finally, integrating them into nutch/lucene. >> >> Thanks in advance, >> Chad >> >> > > ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
