Hi, I've integrated a custom dictionary, retrained some of the OpenNLP models and would like to evaluate the changes on a gold standard. I'd like to calculate the precision, the recall and the f1-score to compare the results.
My question is: Does cTAKES ship with some evaluation / test scripts? What is the best strategry to do this? Has anyone dealt with this topic before? I'm happy to share the results afterwards if there is interest for it. Thanks Leander
