Rupert Westenthaler created STANBOL-1303:
--------------------------------------------
Summary: Geonames LocationEnhancementEngine confidence values are
not in the range [0..1]]
Key: STANBOL-1303
URL: https://issues.apache.org/jira/browse/STANBOL-1303
Project: Stanbol
Issue Type: Bug
Affects Versions: 0.12.0, 1.0.0
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Fix For: 1.0.0, 0.12.1
The Geonames.org service changed the value range of provided scores from
[0..100] to [0..inv]. Because of that the engine does no longer report
fise:confidence values in the range of [0..1].
Looking at the reported numbers one can assume that they do represent the
relative confidence (similar as Solr scores).
For the normalization to [0..1] one could
1. normalize relative to the result with the highest score
2. use the levenshtein distance between the mention in the text with the best
matching label.
Until this gets fixed the unit tests for the engine will be deactivated.
--
This message was sent by Atlassian JIRA
(v6.2#6252)