On 11/07/2012 05:22 PM, Charlie Greenbacker wrote:
Ivelina,
Trained models for the Apache OpenNLP tools (tokenizer, sentence
segmenter, NER, and POS tagging) based on the CONLL02 data can be
found here:
http://opennlp.sourceforge.net/models-1.5/
Thanks Charlie! I'm actually using the models.
Best,
Ivelina
- Charlie
On Wed, Nov 7, 2012 at 10:01 AM, Svetoslav Marinov
<[email protected]
<mailto:[email protected]>> wrote:
Use the free dataset from Conll 2002:
http://www.cnts.ua.ac.be/conll2002/ner/
Успех!
Светослав
On 2012-11-07 15:02, "Ivelina Nikolova" <[email protected]
<mailto:[email protected]>> wrote:
>Dear List Members,
>
>Could you please give me some references to corpora in Dutch with
named
>entity annotation (Person, Organization, Event, Location)?
>Which are the corpora used for training the Dutch Name finders for
>Person, Organization and Location?
>
>Thank you very much!
>Ivelina
>
>
>--
>Ivelina Nikolova
>PhD student in Computer Science
>Linguistic Modelling Department
>Institute of Information and Communication Technologies
>Bulgarian Academy of Sciences
>
>
--
*Charlie Greenbacker*
/Data Scientist/, Berico Technologies
[email protected] <mailto:[email protected]>
<http://www.bericotechnologies.com/>
--
Ivelina Nikolova
PhD student in Computer Science
Linguistic Modelling Department
Institute of Information and Communication Technologies
Bulgarian Academy of Sciences