Classification is more of a machine learning problem, not a linguistics problem. There are a few large, sophisticated (complex) implementations for text analysis: Weka, R, and Mahout.
On Mon, Apr 23, 2012 at 10:41 AM, Alex Kudlick <[email protected]> wrote: > Hi, > > I've just started using open nlp for a project to classify scientific > articles in to subjects. I have a few questions: > > 1. How do I configure logging for the model? I'm using sf4j-log4j for the > rest of my application, but the training output from the model just goes to > stdout. > > 2. Is there any support for classifying documents with multiple classes? > For instance, a given article may be classified as Computational Biology, > Cell Biology, and Molecular Biology. > > Thanks, > > Alex Kudlick -- Lance Norskog [email protected]
