Re: Training new models

swapnil marathe Thu, 27 Mar 2014 23:01:25 -0700

If you wish to make your own models you can makde them using
"Language modelling toolkits"  eg:
SRILM<http://www.speech.sri.com/projects/srilm/>
,MLITM <http://projects.csail.mit.edu/cgi-bin/wiki/view/SLS/MITLMTutorial>


Thanks,
Swapnil


On Fri, Mar 28, 2014 at 4:05 AM, Sanjeev Sharma <
[email protected]> wrote:

> Hi,
>
>
>
> I am new to OpenNLP.  I've been playing with chunking, tokenizing, POS
> tagging, and Name recognition for a few days.  I've been following the
> example code and using preexisting models from
> http://opennlp.sourceforge.net/models-1.5/.  I've been having some trouble
> with name recognition and organization recognition in that using the above
> mentioned models I can only identify common names or organizations like
> "Mike Smith" and "IBM".  In addition I need to be able to find date ranges
> and technical language like "Java", "C++", and "HTML" (I should mention
> that my input is going to be resumes).
>
>
>
> I figured I need to train my own models, especially since my training data
> should look more like my input to give a better context (i.e. resumes).
> I've been trying to find some information on how to do this in the
> documentation and also doing google searches.  I found a few simple
> examples, but not much more.  I did see the example in the documentation
> with the "<START:person> <END>" tags and the command line to process the
> training data into a .bin file, but nothing with organization names.  I
> tried to look at one or two of the annotation guides and that created more
> questions than answers (for example, the annotation guides not consistent
> with each other or the example in the documentation.  Are there pros and
> cons between the different formats?  Are the examples in the documentation
> in a native format?  Is there a conversion utility?  If so and I'm creating
> data from scratch, would it not be better to just put it in the native
> format?)
>
>
>
> I just lack understanding of OpenNLP and NLP in general and the OpenNLP
> Manual just hasn't worked for me.  Maybe I'm just misinterpreting the
> documentation or just not looking in the right place.  I would appreciate
> it greatly if someone could point me in the right direction in the way of
> real world examples of training a model, recommending a book I can read
> through, or maybe just some good examples of training data.  Beyond the
> specific task I'm trying to accomplish, I would like to get a deeper
> understanding of how OpenNLP works.
>
>
>
> Thanks for any help.
>

Re: Training new models

Reply via email to