[ 
https://issues.apache.org/jira/browse/STANBOL-478?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13210137#comment-13210137
 ] 

Rupert Westenthaler edited comment on STANBOL-478 at 2/17/12 9:02 AM:
----------------------------------------------------------------------

The - on demand - inclusion of this could be implemented by using STANBOL-488 - 
EnhancementProperties - on a per-request base. 

But this would bypass the now preferred multi-part content item API. With this 
functionality the intended way for get the plain text content is use the 
multi-part content item RESTful API extension to request specific content parts 
of ContentItems. I am currently working on a detailed documentation on that 
with a lot of examples for typical use cases (including this one). The Web UI 
of the Enhancer already includes a description of the RESTful services.

In general I do not understand the mention of SPARQL, because the Stanbol 
Enhancer does not store enhancement results and therefore can not provide a 
sparql endpoint. If this refers to the SPARQL endpoint of the Contenthub, than 
you might want to have a look at STANBOL-471. This would allow to have a SPARQL 
endpoint on top of (S1) - the store that contains enhanced content items.

As part of the work an STANBOL-471 I will also provide an LDPath wrapper for 
ContentItems. This will also have access to the "contents" of the contentItem. 
A simple LDPath command like

nie:plainTextContent = fn:content(".","text/plain");

would than take the content stored in the current contentItem with the mime 
type "text/plain" and index/store the value with the property 
nie:plainTextContent

best
Rupert
                
      was (Author: rwesten):
    This could be implemented by using 
https://issues.apache.org/jira/browse/STANBOL-488 on a per-request bases.
                  
> Change Metaxa Engine to create PlainText version as ContentPart and change 
> other Engines to retrieve PlainText version from ContentPart
> ---------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: STANBOL-478
>                 URL: https://issues.apache.org/jira/browse/STANBOL-478
>             Project: Stanbol
>          Issue Type: Improvement
>          Components: Enhancer
>            Reporter: Rupert Westenthaler
>            Assignee: Walter Kasper
>
> Instead of adding/reading the "text/plain" version of an ContentItem to/from 
> the metadata of the ContentItem the new ContentPart API should be used for 
> that.
> This will require the Metaxa Engine to store literal values of all Triples 
> with the ContentItem.getUri() as subject and
>     
>     http://www.semanticdesktop.org/ontologies/2007/01/19/nie#plainTextContent
> as property to a Blob and add this as ContentPart to the ContentItem.
> Other EnhancementEngines need than to search for a Blob with the MimeType 
> "text/plain" instead of retrieving the plain text from the metadata.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to