The classifier will probably make mistakes from time to time. I would
try to use a threshold which
indicates that a choice is reasonable confident.
If all categories fall below that threshold you can label that item as
unkown.
Jörn
On 10/28/2014 07:19 PM, [email protected] wrote:
Yeah, it looks like they get the same 1/n score...seems kind of hacky to depend
on that, but so far does appear to work. I think I need a confidence score
system that is [0...1] per category rather than normalized such that the sum of
all weights is 1.0.
Is there any option to do this? If not, where does one dig to make such changes?
Patrick Baggett
Online Engineer - Search Team
e: [email protected]
p: +1 (214) 202-8964
-----Original Message-----
From: Mark G [mailto:[email protected]]
Sent: Tuesday, October 28, 2014 9:55 AM
To: [email protected]
Subject: Re: Getting started with OpenNLP
I think if nothing matches the model at all each cat will have the same score
associated.
________________________________
The information in this Internet Email is confidential and may be legally
privileged. It is intended solely for the addressee. Access to this Email by
anyone else is unauthorized. If you are not the intended recipient, any
disclosure, copying, distribution or any action taken or omitted to be taken in
reliance on it, is prohibited and may be unlawful. When addressed to our
clients any opinions or advice contained in this Email are subject to the terms
and conditions expressed in any applicable governing The Home Depot terms of
business or client engagement letter. The Home Depot disclaims all
responsibility and liability for the accuracy and content of this attachment
and for any damages or losses arising from any inaccuracies, errors, viruses,
e.g., worms, trojan horses, etc., or other items of a destructive nature, which
may be contained in this attachment and shall not be liable for direct,
indirect, consequential or special damages in connection with this e-mail
message or its attachment.