Hi all,

> On 11.05.2017, at 18:37, Joern Kottmann <[email protected]> wrote:
> 
> The Apache OpenNLP PMC would like to call for a Vote on Apache OpenNLP
> 1.8.0 Release Candidate 2. 

Should OpenNLP 1.8.0 yield identical results as 1.7.2 when the same
models are used during classification?

E.g. the English parser model seems to create different POS tags now
for the sentence "We need a very complicated example sentence , 
which contains as many constituents and dependencies as possible .".
"a" is now wrongly tagged as "," whereas 1.7.2 tagged it correctly as "DT".

Should OpenNLP 1.8.0 yield identical results as 1.7.2 when the same
training data is used during training?

I have a test that trains a lemmatizer model on GUM 3.0.0. With 1.7.2,
this model reached an f-score of ~0.96. With 1.8.0, I only get ~0.84.

Cheers,

-- Richard


Reply via email to