Rupert Westenthaler created STANBOL-631:
-------------------------------------------

             Summary: Add "confidence-level" to the Stanbol Enhancement 
Structure
                 Key: STANBOL-631
                 URL: https://issues.apache.org/jira/browse/STANBOL-631
             Project: Stanbol
          Issue Type: New Feature
            Reporter: Rupert Westenthaler
             Fix For: 0.10.0-incubating


The "confidence-level" is intended to make it easier for clients to
decide how to process Enhancements. It would not use a numerical range
but four distinct values:

* confident: Meaning that a match is very likely - indicating that
those annotations typically can be accepted automatically (e.g. If the
EntityLinking engine finds a single Entity that exactly matches the
text selected by an text annotation)
* ambiguous: Meaning that there are several possibilities but is is
still likely that one of them match (e.g. Paris, Paris (Texas))
* suggestion: Meaning that the match is not completely certain, but
there are not several options (e.g. Germans -> Germany)
* uncertain: Meaning that Entities do match, but the probability of a
match is rather speculative (e.g. John -> Elton John)

IMHO using this classification would fit a lot of engines much better
as the numeric "fise:confidence" property as it does not rise the
expectation in users that confidence values are on a rational scale
(e.g. a Enhancement with a confidence of "0.8" is not two times as
likely as one with "0.4").

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to