Hello All, I'm doing some research on how to classify documents into pre-defined categories. Some methods I have come across are Ontologies, topic maps, url/site based and simple keyword analysis. I'm leaning towards topic maps and Ontologies being the strongest and most documented in theory and in practice. Does the group have any recommendations on where to start? Software packages to help develop the owl/rdf files? Protoge? Any consultancies out there that handle this process? Downfalls to using these? And finally, integrating them into nutch/lucene.
Thanks in advance, Chad ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys - and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
