Using MulticlassClassificationEvaluator for NER evaluation

martin Mon, 25 Oct 2021 03:41:42 -0700

Hello,

I am using SparkNLP to do some NER. The result datastructure aftertraining and classification is a Dataset<Row>, with one column each forlabels and predictions. For evaluating the model, I would like to usethe Spark ML classorg.apache.spark.ml.evaluation.MulticlassClassificationEvaluator.However, this evaluator expects labels as double numbers. In the case ofan NER task, the results in my case are of typearray<struct<annotatorType:string,begin:int,end:int,result:string,metadata:map<string,string>,embeddings:array<float>>>.

It would be possible, of course, to convert this format to the requireddoubles. But is there a way to easily applyMulticlassClassificationEvaluator to the NER task or is there maybe abetter evaluator? I haven't found anything yet (neither in Spark ML norin SparkNLP).


Thanks a lot.

Cheers,

Martin

Using MulticlassClassificationEvaluator for NER evaluation

Reply via email to