scadams commented on PR #94:
URL: https://github.com/apache/opennlp-site/pull/94#issuecomment-3342755333

   @jzonthemtn @rzo1 Hopefully this answers your questions:
   
   The training data came from CoNLL 2006 and all of it can be downloaded here 
along with documentation and license information: [CoNLL-X Shared Task: 
Multi-lingual Dependency 
Parsing](https://web.archive.org/web/20070503133311/http://nextens.uvt.nl/~conll/free_data.html)
   
   These are/were the data sources for each language:
   
   - Danish: The Danish Dependency Treebank
   - Dutch: The Alpino Treebank
   - Portuguese: The Floresta Sintá(c)tica project
   - Swedish: Talbanken05 Swedish treebank
   
   This section of the wiki I mentioned above describes how the models were 
trained using this data: 
https://web.archive.org/web/20100917162145/http://sourceforge.net/apps/mediawiki/opennlp/index.php?title=Newlang#Language_Data
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to