[ 
https://issues.apache.org/jira/browse/STANBOL-1303?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rupert Westenthaler resolved STANBOL-1303.
------------------------------------------

    Resolution: Fixed

implemented a simple alg. that normalizes based on the suggestion with the max. 
score
{code}
    score = Math.log1p(score) / Math.log1p(maxScore)
{code}

applied fix with http://svn.apache.org/r1597760 in 0.12 and  
http://svn.apache.org/r1597762 in trunk 

> Geonames LocationEnhancementEngine confidence values are not in the range 
> [0..1]]
> ---------------------------------------------------------------------------------
>
>                 Key: STANBOL-1303
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1303
>             Project: Stanbol
>          Issue Type: Bug
>    Affects Versions: 0.12.0, 1.0.0
>            Reporter: Rupert Westenthaler
>            Assignee: Rupert Westenthaler
>             Fix For: 1.0.0, 0.12.1
>
>
> The Geonames.org service changed the value range of provided scores from 
> [0..100] to [0..inv]. Because of that the engine does no longer report 
> fise:confidence values in the range of [0..1].
> Looking at the reported numbers one can assume that they do represent the 
> relative confidence (similar as Solr scores).
> For the normalization to [0..1] one could 
> 1. normalize relative to the result with the highest score
> 2. use the levenshtein distance between the mention in the text with the best 
> matching label.
> Until this gets fixed the unit tests for the engine will be deactivated.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to