Re: Confusion matrix report for POS Tagger evaluators

Jörn Kottmann Sun, 26 Feb 2012 16:45:17 -0800

+1 also needed for doccat.

Maybe it can be created by a class which could also
be used for doccat.


Jörn

On 02/26/2012 03:13 AM, Jason Baldridge wrote:

+1 Fine-grained error analysis FTW!

On Sat, Feb 25, 2012 at 4:57 PM, [email protected]<
[email protected]>  wrote:

Hi,

I implemented a new EvaluationMonitor for the POS Tagger. It generates
a confusion
matrix<http://en.wikipedia.org/wiki/Confusion_matrix>  for each token that
was not tagged properly.

Example output (Portuguese):

...
Accuracy for [que]: 91,34%
1316 ocurrencies. Confusion matrix (line: reference; column: predicted):
           |    conj-s | pron-indp |       adv |  pron-det || % Accu ||
    conj-s |>      537<|       40  |        0  |        0  || 93,07% ||
  pron-indp |       59  |>      661<|        0  |        0  || 91,81% ||
       adv |        2  |       12  |>        4<|        0  || 22,22% ||
  pron-det |        0  |        1  |        0  |>        0<||     0% ||

Accuracy for [o]: 98,48%
3949 ocurrencies. Confusion matrix (line: reference; column: predicted):
           |       art |  pron-det | pron-pers |         , || % Accu ||
       art |>     3857<|        4  |        0  |        1  || 99,87% ||
  pron-det |       36  |>       24<|        0  |        0  ||    40% ||
  pron-pers |       19  |        0  |>        8<|        0  || 29,63% ||
         , |        0  |        0  |        0  |>        0<||     0% ||

Accuracy for [a]: 96%
4395 ocurrencies. Confusion matrix (line: reference; column: predicted):
           |       art |       prp | pron-pers |  pron-det || % Accu ||
       art |>     3291<|       54  |        0  |        0  || 98,39% ||
       prp |      107  |>      922<|        0  |        0  ||  89,6% ||
  pron-pers |        4  |        0  |>        4<|        0  ||    50% ||
  pron-det |       11  |        0  |        0  |>        2<|| 15,38% ||
...

Do you think it is interesting to make this report available?
I would add it to the CLI and it would be activated by an new argument that
pass in an output file for the report.

Thank you,
William

Re: Confusion matrix report for POS Tagger evaluators

Reply via email to