Hi Joern, I have got the data from the following link which consist of corpus of new articles. http://trec.nist.gov/data/reuters/reuters.html
Following the steps given in the below link I have created training and test data but it is not working with the NameFinder of opennlp api. http://www.clips.uantwerpen.be/conll2003/ner/000README So can you please help me how to create training data out of that corpus and use it to create name entity detection models? With Regards Madhvi Gupta *(Senior Software Engineer)* On Mon, Feb 20, 2017 at 1:00 AM, Joern Kottmann <kottm...@gmail.com> wrote: > Hello, > > to train the name finder you need training data that contains the entities > you would like to decect. > Is that the case with the data you have? > > Take a look at our documentation: > https://opennlp.apache.org/documentation/1.7.2/manual/ > opennlp.html#tools.namefind.training > > At the beginning of that section you can see how the data has to be marked > up. > > Please note you that you need many sentences to train the name finder. > > HTH, > Jörn > > > On Sat, Feb 18, 2017 at 11:28 AM, Madhvi Gupta <mgmahi....@gmail.com> > wrote: > > > Hi All, > > > > I have got reuters data from NIST. Now I want to generate the training > data > > from that to create a model for detecting named entities. Can anyone tell > > me how the models can be generated from that. > > > > -- > > With Regards > > Madhvi Gupta > > *(Senior Software Engineer)* > > > --