Hi All, I did a little testing with the German CONLL03 data, we only get a recall of around 38% and a precision of 82% on the development data for person names.
I wonder what we are doing wrong here, that the numbers are so bad compared to other systems which participated back than and get a similar precision but much higher recall. Is the lack of lemma and pos features causing this? Or could it be something else? These guys have a much better recall, and also use a maxent based system: http://www.cnts.ua.ac.be/conll2003/pdf/18083kle.pdf Any ideas what could be done to improve our name finder? Jörn
