Sure, but OpenNLP comes out of the box with a document classifier. Is that
no longer being supported?
On , Lance Norskog <[email protected]> wrote:
Classification is more of a machine learning problem, not a
linguistics problem. There are a few large, sophisticated (complex)
implementations for text analysis: Weka, R, and Mahout.
On Mon, Apr 23, 2012 at 10:41 AM, Alex Kudlick [email protected]> wrote:
> Hi,
>
> I've just started using open nlp for a project to classify scientific
> articles in to subjects. I have a few questions:
>
> 1. How do I configure logging for the model? I'm using sf4j-log4j for
the
> rest of my application, but the training output from the model just
goes to
> stdout.
>
> 2. Is there any support for classifying documents with multiple classes?
> For instance, a given article may be classified as Computational
Biology,
> Cell Biology, and Molecular Biology.
>
> Thanks,
>
> Alex Kudlick
--
Lance Norskog
[email protected]