Rupert Westenthaler created STANBOL-1417:
--------------------------------------------

             Summary: Clreate Language Annotation for parsed "Content-Language" 
header
                 Key: STANBOL-1417
                 URL: https://issues.apache.org/jira/browse/STANBOL-1417
             Project: Stanbol
          Issue Type: Improvement
          Components: Enhancement Engines
    Affects Versions: 0.12.0
            Reporter: Rupert Westenthaler
            Assignee: Rupert Westenthaler
            Priority: Minor
             Fix For: 1.0.0, 0.12.1


Stanbol supports parsing the language of the content by using the 
"Content-Language" header since STANBOL-660. However currently only the 
`dc:language` property is set for the ContentItem.

However based on the specification of STANBOL-613 this information is only used 
as fallback if no language annotation is present in the ContentItem. So as soon 
as any Language Identification Engine is present in the Chain the 
"Content-Language" as parsed by the User will get ignored. This is not the 
intention of a user explicitly parsing the language.

To force Stanbol to use the parsed language a Language Annotation with the 
confidence 1.0 needs to be added to the metadata of the ContentItem instead. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to