Hello, Ben, We have an example of an abbreviation dictionary in the tests: https://github.com/apache/opennlp/blob/master/opennlp- tools/src/test/resources/opennlp/tools/sentdetect/abb.xml
Regarding ready-to-use models, we have many here: http://opennlp. sourceforge.net/models-1.5/ If you need a tutorial, there are many online. Our docs are here. You can find code snippets and information how to use the command line interface. https://opennlp.apache.org/documentation/1.7.2/manual/opennlp.html Regards, William 2017-04-12 18:29 GMT-03:00 Benedict Holland <[email protected]>: > Hello All, > > I am getting into NLP for a project and this is the solution we are going > to use. I noticed that in many places there is something called the abbdict > flag but there is not a specification for it. I believe it is an xml > document. Could someone please provide a sample xml file and a brief > description of the file format? > > In addition, is there a quick guide on starting with text, going through > the various learning steps, example files, and expected output? I don't > mean the manual but more like a true beginners guide with all of the > example files and each of the commands run in a particular order and the > expected output? I noticed, for example, I cannot download a sentence > learning text en-sent.train because (I think) it is not free or can't be > distributed. > > It would be very helpful to provide .train files for each step of the > process, even as a simple example. > > Thanks, > ~Ben >
