Martin Wiesner created OPENNLP-1639:
---------------------------------------
Summary: Add basic ud-training.conf file and reference it in
ud-train.sh
Key: OPENNLP-1639
URL: https://issues.apache.org/jira/browse/OPENNLP-1639
Project: OpenNLP
Issue Type: Sub-task
Components: Models
Affects Versions: 2.4.0
Reporter: Martin Wiesner
Fix For: 2.5.1
We want to provide and reference a file with default training parameters.
With the help of that file, we want to customize certain parameters, such as
{+}Threads{+}, {+}Iterations{+}, etc., to speed up training runs on certain
hardware.
Currently, it seems, training of model files is single-threaded, resulting in
slow Lemmatizer model training...
Moreover, only 100 iterations will be conducted, by default. In that case, some
models could provide better performance if, say, 300 iterations could be
configured.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)