[ANN] - annotator-clj gets built-in support for openNLP compatible annotations (NameFinder)

Jim - FooBar(); Fri, 11 Jan 2013 11:14:17 -0800

Hi all,

I finally got the chance to clean-up and essentially revisit some bitsof code that have helped me a lot the past year. I put it all togetherin a project and open-sourced it just in case anyone else find ituseful. The project is a high-performance, dictionary-based annotatorwhich can be tuned for either openNLP or stanfordNLP or some custom NERengine. Features include:


 * openNLP or stanfordNLP or custom NER component compatibility
 * fully parallel annotations of separate documents (optional)
 * flexible API can deal with multiple dictionaries per document
   (merges them in a set)
 * custom tags are supported and can be provided directly on the
   command-line
 * basic normalisation is applied to the dictionary entries
   (un-capitalisation - unless they are all capital)
 * options to merge all the annotations together in a single file or
   write them separately on dedicated directory
 * fully functional command-line interface
 * fully usable from any JVM-based language
 * non-reflective source code
 * data-centric & immutable API

The project lives here: https://github.com/jimpil/annotator-clj

Feel free to try it out...:)

Jim

[ANN] - annotator-clj gets built-in support for openNLP compatible annotations (NameFinder)

Reply via email to