Re: Tokenizer issue - Quotation marks

Jörn Kottmann Thu, 24 Feb 2011 12:47:09 -0800

On 2/24/11 12:10 PM, Rohana Rajapakse wrote:

Thanks a lot.


I did create a name finder training data file using CONNEL sometimes ago. Will 
have a look at how I did it. I may be able to convert this training file to 
produce a tokenizer training file.

Thanks a lot. Will let you know how I get on with this. Would contribute 
anything that might be useful to others.

English detoeknizer rules will be helpful for many, would be nice if youcould contribute yours then,there is a general one you can use to start with insideopennlp-tools/src/test/resources/opennlp/tools/latin-detokenizer.xml

The name finder training data you have should be good enough to startwith, depending

on the way its tokenized.

Jörn

Re: Tokenizer issue - Quotation marks

Reply via email to