[
https://issues.apache.org/jira/browse/STANBOL-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Rupert Westenthaler resolved STANBOL-1303.
------------------------------------------
Resolution: Fixed
implemented a simple alg. that normalizes based on the suggestion with the max.
score
{code}
score = Math.log1p(score) / Math.log1p(maxScore)
{code}
applied fix with http://svn.apache.org/r1597760 in 0.12 and
http://svn.apache.org/r1597762 in trunk
> Geonames LocationEnhancementEngine confidence values are not in the range
> [0..1]]
> ---------------------------------------------------------------------------------
>
> Key: STANBOL-1303
> URL: https://issues.apache.org/jira/browse/STANBOL-1303
> Project: Stanbol
> Issue Type: Bug
> Affects Versions: 0.12.0, 1.0.0
> Reporter: Rupert Westenthaler
> Assignee: Rupert Westenthaler
> Fix For: 1.0.0, 0.12.1
>
>
> The Geonames.org service changed the value range of provided scores from
> [0..100] to [0..inv]. Because of that the engine does no longer report
> fise:confidence values in the range of [0..1].
> Looking at the reported numbers one can assume that they do represent the
> relative confidence (similar as Solr scores).
> For the normalization to [0..1] one could
> 1. normalize relative to the result with the highest score
> 2. use the levenshtein distance between the mention in the text with the best
> matching label.
> Until this gets fixed the unit tests for the engine will be deactivated.
--
This message was sent by Atlassian JIRA
(v6.2#6252)