[ 
https://issues.apache.org/jira/browse/STANBOL-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rafa Haro updated STANBOL-1156:
-------------------------------

    Description: 
Since STANBOL-1014, it is possible to generate an EntityHub site for the 
Freebase Knowledge Base. As part of Google Summer of Code call for 2013, there 
has been a proposal for Freebase Entity Disambiguation. Proposal details can be 
found in the following link: 
http://www.google-melange.com/gsoc/project/google/gsoc2013/adperezmorales/10001.
 The disambiguation process for Freebase should also follow the workflow and 
architecture stablished at STANBOL-1037.

The project development has been divided in three global tasks:

1. Integration of resources for local disambiguation. Wikilinks 
(http://www.iesl.cs.umass.edu/data/wiki-links) is a dataset that provides URLs 
of webpages, along with the anchor of the links, and the Wikipedia and Freebase 
pages they link to. As provided, this dataset can be used to get all the 
surface strings that refer to a Wikipedia page, but further, it can be used to 
download the webpages and extract the context around the webpages. This 
contexts can be used for local disambiguation against Content Items mention 
contexts.

2. Integration of resources for global disambiguation: Freebase is an enormous 
graphs of related entities and concepts. The structure of this graph can be 
used to compute groups of entities that are semantically related in a document. 
For example, we can use the relationship between Michael Jordan and NBA to 
disambiguate Michael Jordan in a text. The goal of this task is to store the 
Freebase graph structure in a Neo4j database and provide an API to use it for 
disambiguation purposes.

3. Disambiguation algorithm: finally, it is necessary to write an algorithm 
that take into account the local and global disambiguations score in order to 
refine the confidence values of the EntityAnnotations in the Enhancement 
Structure

  was:
Since STANBOL-1014, it is possible to generate an EntityHub site for the 
Freebase Knowledge Base. As part of Google Summer of Code call for 2013, there 
has been a proposal for Freebase Entity Disambiguation. Proposal details can be 
found in the following link: 
http://www.google-melange.com/gsoc/project/google/gsoc2013/adperezmorales/10001.
 The disambiguation process for Freebase should also follow the workflow and 
architecture stablished at STANBOL-1037.

The project development has been divided in three global tasks:

1. Integration of resources for local disambiguation. Wikilinks (

    
> Freebase Entity Disambiguation
> ------------------------------
>
>                 Key: STANBOL-1156
>                 URL: https://issues.apache.org/jira/browse/STANBOL-1156
>             Project: Stanbol
>          Issue Type: Story
>          Components: Data, Enhancement Engines, Enhancer, Entityhub
>            Reporter: Rafa Haro
>              Labels: disambiguation, freebase, gsoc2013, mentoring
>             Fix For: 0.12.0
>
>
> Since STANBOL-1014, it is possible to generate an EntityHub site for the 
> Freebase Knowledge Base. As part of Google Summer of Code call for 2013, 
> there has been a proposal for Freebase Entity Disambiguation. Proposal 
> details can be found in the following link: 
> http://www.google-melange.com/gsoc/project/google/gsoc2013/adperezmorales/10001.
>  The disambiguation process for Freebase should also follow the workflow and 
> architecture stablished at STANBOL-1037.
> The project development has been divided in three global tasks:
> 1. Integration of resources for local disambiguation. Wikilinks 
> (http://www.iesl.cs.umass.edu/data/wiki-links) is a dataset that provides 
> URLs of webpages, along with the anchor of the links, and the Wikipedia and 
> Freebase pages they link to. As provided, this dataset can be used to get all 
> the surface strings that refer to a Wikipedia page, but further, it can be 
> used to download the webpages and extract the context around the webpages. 
> This contexts can be used for local disambiguation against Content Items 
> mention contexts.
> 2. Integration of resources for global disambiguation: Freebase is an 
> enormous graphs of related entities and concepts. The structure of this graph 
> can be used to compute groups of entities that are semantically related in a 
> document. For example, we can use the relationship between Michael Jordan and 
> NBA to disambiguate Michael Jordan in a text. The goal of this task is to 
> store the Freebase graph structure in a Neo4j database and provide an API to 
> use it for disambiguation purposes.
> 3. Disambiguation algorithm: finally, it is necessary to write an algorithm 
> that take into account the local and global disambiguations score in order to 
> refine the confidence values of the EntityAnnotations in the Enhancement 
> Structure

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to