Hello
I have tried to train a Parser model from the data provided by
http://universaldependencies.org/ in the conllu format,
but I failed.
The documentation for the training process (with the java API) is
lacking (how to read from conllu files ?)
and it looks like the command line utility expects things to be in the
openNlp format (what is that ? is there some utility that can convert
from conllu format to openNlp ?)
It is really frustrating, not to be able to train a model (I just spent
days trying) especially since using pre-trained model is a breeze and
make you believe that parsing is right at the tip of your fingers (it
you manage to create an ObjectStream<Parse> from a conllu file, that
is...which I miserably failed)
Can someone please help, by providing a working sample of training a
Parser from a conllu file ?
or for creating an ObjectStream<Parse> from a conllu treebank (like
universal dependency)
(the one provided in the openNlp documentation
(http://opennlp.apache.org/docs/1.8.4/manual/opennlp.html#tools.parser.parsing.api)
is somewhat, by the way : it doesn't tell how to create mlParameters,
etc...)
best regards,
Olivier
- How to train a parser from conllu data Olivier Binda
-