Re: Using MulticlassClassificationEvaluator for NER evaluation

martin Wed, 10 Nov 2021 23:59:44 -0800

Hi Sean,

Apologies for the delayed reply. I've been away on vacation and thenbusy catching up afterwards.

Regarding the evalution using MulticlassClassificationEvaluator: This isa about a sequence labeling task to identify specific non-standard namedentities. The training and evaluation data is in CoNLL format. Thetraining works all fine, using the categorical labels for the NEs. Inorder to use the MulticlassClassificationEvaluator, however, I need toconvert these to floats. This is possible and also works fine, it isjust inconvenient having to do the extra step. I would have expected theMulticlassClassificationEvaluator to be able to use the labels directly.

I will try to create and propose a code change in this regard, if orwhen I find the time.


Cheers,

Martin

Am 2021-10-25 14:31, schrieb Sean Owen:

I don't think the question is representation as double. The question ishow this output represents a label? This looks like the result of anannotator. What are you classifying? you need, first, ground truth andprediction somewhere to use any utility to assess classificationmetrics.
On Mon, Oct 25, 2021 at 5:42 AM <mar...@wunderlich.com> wrote:
Hello,
I am using SparkNLP to do some NER. The result datastructure aftertraining and classification is a Dataset<Row>, with one column eachfor labels and predictions. For evaluating the model, I would like touse the Spark ML classorg.apache.spark.ml.evaluation.MulticlassClassificationEvaluator.However, this evaluator expects labels as double numbers. In the caseof an NER task, the results in my case are of typearray<struct<annotatorType:string,begin:int,end:int,result:string,metadata:map<string,string>,embeddings:array<float>>>.
It would be possible, of course, to convert this format to therequired doubles. But is there a way to easily applyMulticlassClassificationEvaluator to the NER task or is there maybe abetter evaluator? I haven't found anything yet (neither in Spark MLnor in SparkNLP).
Thanks a lot.

Cheers,

Martin

Re: Using MulticlassClassificationEvaluator for NER evaluation

Reply via email to