On 07/04/2012 12:35 PM, Daniel wrote:
I see, so I can't train the existing OpenNLP model for detect person
names "es-ner-person.bin"....I would need the .train file that OpenNLP
used to create this model, and concatenate that file with my new
trainning files, isnt it?

OpenNLP used conll2002 data to create "es-ner-person.bin", so I have
downloaded it from here http://www.cnts.ua.ac.be/conll2002/ner.tgz but
Im not able to use "esp.train", because when I run it

C:\>opennlp TokenNameFinderTrainer -lang es -data esp.train -model es_person.bin

I get this error:

java.lang.IllegalArgumentException: Model not compatible with name finder!


so I guess that I must convert this data file to OpenNLP format, but I use:

C:\>opennlp TokenNameFinderConverter conll02 -data esp.train -lang es
-types per > corpus_train.txt

and I get this error:

IO error while reading training data or indexing data: Expected three
fields per line in training data!


The first command should be like this:
opennlp TokenNameFinderTrainer.conll02 -lang es -data esp.train -types per -model es_person.bin

Hmm, how does you esp.train looks like? Can you post a sentence of it here?

Jörn

Reply via email to