Hello, Ben,

We have an example of an abbreviation dictionary in the tests:
https://github.com/apache/opennlp/blob/master/opennlp-
tools/src/test/resources/opennlp/tools/sentdetect/abb.xml

Regarding ready-to-use models, we have many here: http://opennlp.
sourceforge.net/models-1.5/

If you need a tutorial, there are many online.

Our docs are here. You can find code snippets and information how to use
the command line interface.
https://opennlp.apache.org/documentation/1.7.2/manual/opennlp.html

Regards,
William

2017-04-12 18:29 GMT-03:00 Benedict Holland <[email protected]>:

> Hello All,
>
> I am getting into NLP for a project and this is the solution we are going
> to use. I noticed that in many places there is something called the abbdict
> flag but there is not a specification for it. I believe it is an xml
> document. Could someone please provide a sample xml file and a brief
> description of the file format?
>
> In addition, is there a quick guide on starting with text, going through
> the various learning steps, example files, and expected output? I don't
> mean the manual but more like a true beginners guide with all of the
> example files and each of the commands run in a particular order and the
> expected output? I noticed, for example, I cannot download a sentence
> learning text en-sent.train because (I think) it is not free or can't be
> distributed.
>
> It would be very helpful to provide .train files for each step of the
> process, even as a simple example.
>
> Thanks,
> ~Ben
>

Reply via email to