Rupert Westenthaler created STANBOL-1054:
--------------------------------------------
Summary: Add support for MLT queries over URIs
Key: STANBOL-1054
URL: https://issues.apache.org/jira/browse/STANBOL-1054
Project: Stanbol
Issue Type: Sub-task
Components: Enhancement Engines, Entityhub
Reporter: Rupert Westenthaler
Assignee: Rupert Westenthaler
Currently MLT queries do expect a textual context. However when disambiguation
concepts of a knowledge base a similarity search over a field with referenced
(related) concepts would be preferable computed over the list of concept URIs
instead of their Labels.
For this to work
* the Similarity Constraint needs to be extended to provide information about
the DataType of the context
* the MLT implementation of the SolrYard needs to be extended to correctly
process the type so that it does search in the correct Solr fields
* the Solr Field type used by the default Solr schema.xml of the Entityhub
needs to be changed so that a whitespace tokenizer is used for the URI field at
query time (to split multiple URIs parsed in the context of MLT queries)
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira