Author: buildbot
Date: Thu Jun 30 09:18:42 2011
New Revision: 791903
Log:
Staging update by buildbot
Added:
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contentenhancement.html
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/engines.html
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/enhancer.html
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/features.html
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/index.html
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/services.html
Added:
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contentenhancement.html
==============================================================================
---
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contentenhancement.html
(added)
+++
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contentenhancement.html
Thu Jun 30 09:18:42 2011
@@ -0,0 +1,252 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html>
+<head>
+<!--
+
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE- 2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+
+ <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+ <title>Apache Stanbol - </title>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+ <link rel="icon" type="image/png"
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+ <div id="navigation">
+ <img alt="Apache Stanbol" width="220" height="101"
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/>
+ <h1 id="stanbol_links">Stanbol links</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+</ul>
+<h1 id="asf_links">ASF links</h1>
+<ul>
+<li><a href="http://www.apache.org">Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0">License</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html">Become a
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/">Security</a></li>
+</ul>
+ </div>
+
+ <div id="content">
+ <h1 class="title"></h1>
+ <p><em><a href="index.html">Back to index</a></em></p>
+<h1 id="using_apache_stanbol_for_enhancing_textual_content">Using Apache
Stanbol for enhancing textual content</h1>
+<hr />
+<p>For enhancing content you simply post plain text content to the Enhancement
Engines and you will get back enhancement data. The enhancement process is
stateless, so neither your content item, nor the enhancements will be stored.
</p>
+<p>You can test this via the [web interface of the engines][stan-engines] or
from console via</p>
+<div class="codehilite"><pre><span class="n">curl</span> <span
class="o">-</span><span class="n">X</span> <span class="n">POST</span> <span
class="o">-</span><span class="n">H</span> <span class="s">"Accept:
text/turtle"</span> <span class="o">-</span><span class="n">H</span> <span
class="s">"Content-type: text/plain"</span> <span class="o">\</span>
+<span class="o">--</span><span class="n">data</span> <span
class="s">"John Smith was born in London."</span> <span
class="n">http:</span><span class="sr">//</span><span
class="n">localhost:8080</span><span class="o">/</span><span
class="n">engines</span>
+</pre></div>
+
+
+<p>or by using the text examples delivered with Stanbol.</p>
+<div class="codehilite"><pre><span class="k">for</span> <span
class="n">file</span> <span class="n">in</span> <span
class="n">enhancer</span><span class="sr">/data/</span><span
class="n">text</span><span class="o">-</span><span
class="n">examples</span><span class="o">/*.</span><span
class="n">txt</span><span class="p">;</span>
+<span class="k">do</span>
+<span class="n">curl</span> <span class="o">-</span><span class="n">X</span>
<span class="n">POST</span> <span class="o">-</span><span class="n">H</span>
<span class="s">"Accept: text/turtle"</span> <span
class="o">-</span><span class="n">H</span> <span class="s">"Content-type:
text/plain"</span> <span class="o">-</span><span class="n">T</span> <span
class="nv">$file</span> <span class="n">http:</span><span
class="sr">//</span><span class="n">localhost:8080</span><span
class="o">/</span><span class="n">engines</span><span class="p">;</span>
+<span class="n">done</span>
+</pre></div>
+
+
+<p>Content items in formats other than plain text can be tested via the [web
interface of contenthub][stan-contenthub] or via the console by attaching
files. (The Metaxa Engine needs to be activated).</p>
+<h2 id="using_the_enhancement_engines">Using the enhancement engines</h2>
+<p>Apache Stanbol starts with a number of active enhancement engines by
default. You can activate or deactivate engines as well as configure them to
your needs via the [OSGI administration console][stan-admin].</p>
+<p>For the enhancement engines, a workflow for the enhancement process is
defined as pre-processing, content-extraction, extraction-enhancement, default
and post-processing. </p>
+<p>The following pre-processing engines are available:</p>
+<ul>
+<li>
+<p>The <strong>Language Identification Engine</strong> detects several
European languages of the content items you want to process.</p>
+</li>
+<li>
+<p>The <strong>Metaxa Engine</strong> extracts embedded metadata and textual
content from a large variety of document types and formats.</p>
+</li>
+</ul>
+<p>For content extraction / natural language processing one engine is
available:</p>
+<ul>
+<li>The <strong>Named Entity Extraction Enhancement Engine</strong> leverages
the sentence detector and name finder tools of the OpenNLP project bundled with
statistical models trained to detect occurrences of names of persons, places
and organizations.</li>
+</ul>
+<p>The extracted items will then be enhanced by a dedicated engine:</p>
+<ul>
+<li>The <strong>Named Entity Tagging Engine</strong> provides according
suggestions from dbpedia (default) and other references sites for entities
extracted by the NER engine .</li>
+</ul>
+<p>Specific additional enhancement engines are: </p>
+<ul>
+<li>
+<p>The <strong>Location Enhancement Engine</strong> takes its suggestions from
geonames.org only.</p>
+</li>
+<li>
+<p>The <strong>OpenCalais Enhancement Engine</strong> uses services from Open
Calais. (Note: You need to provide a key in order to use this engine)</p>
+</li>
+<li>
+<p>The <strong>Zemanta Enhancement Engine</strong> uses the Zemanta services.
(Note: You need to provide a key in order to use this engine)</p>
+</li>
+</ul>
+<p>For post-processing the results of the enhancement engines</p>
+<ul>
+<li>The <strong>CachingDereferencerEngine</strong> is used for the Web UI and
fetches files such as images for locations from external sites and is used to
present the enhancement results. </li>
+</ul>
+<h2 id="using_an_index_of_linked_open_data_locally">Using an index of linked
open data locally</h2>
+<p>To use the pre-configured indexes you can download them from
[here][stan-download]. You will get two files for each index:</p>
+<ul>
+<li>org.apache.stanbol.data.site.{name}-{version}.jar </li>
+<li>{name}.solrindex.zip</li>
+</ul>
+<p>By copying the zip archive into the "/sling/datafiles" folder before
installing the bundle, the data will used during the installation of the bundle
automatically. If you provide the file after installing the bundle, you will
need to restart the SolrYard installed by the bundle.</p>
+<p>The jar can be installed at any OSGI environment running the Apache Stanbol
Entityhub. When started it will create and configure:</p>
+<ul>
+<li>a "ReferencedSite" accessible at
"http://{host}/{root}/entityhub/site/{name}"</li>
+<li>a "Cache" used to connect the ReferencedSite with your Data and</li>
+<li>a "SolrYard" that manages the data indexed by this utility.</li>
+</ul>
+<p>This bundle does not contain the indexed data but only the configuration
for the Solr Index.</p>
+<p>If one has not copied the archive beforehand, the ZIP archive will be
requested by the Apache Stanbol Data File Provider after installing the Bundle.
To install the data you need copy this file to the "/sling/datafiles" folder
within the working directory of your Stanbol Server.</p>
+<p><em>Note: {name} denotes to the value you configured for the "name"
property within the "indexing.properties" file.</em></p>
+<h2 id="enhancement_example">Enhancement Example</h2>
+<p>The text "John Smith was born in London." with a minimal configuration of
enhancement engines (<em>Named Entity Extraction Enhancement Engine</em> and
<em>Named Entity Tagging Engine</em>) together with a local index of dbpedia
entities will result in the following output graph of several Entity
Annotations and Text Annotations (in Turtle-Syntax):</p>
+<div class="codehilite"><pre><span
class="sr"><urn:enhancement-37e084aa-abbd-5f15-d5e7-a59feea4deb2></span>
+ <span class="n">a</span> <span
class="sr"><http://fise.iks-project.eu/ontology/EntityAnnotation></span>
<span class="p">,</span> <span
class="sr"><http://fise.iks-project.eu/ontology/Enhancement></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/confidence></span>
+ <span class="s">"46.396988"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#double></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-label></span>
+ <span class="s">"London, Ontario"</span><span
class="nv">@en</span> <span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-reference></span>
+ <span
class="sr"><http://dbpedia.org/resource/London,_Ontario></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-type></span>
+ <span class="sr"><http://www.w3.org/2002/07/owl#Thing></span>
<span class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/City></span> <span
class="p">,</span> <span
class="sr"><http://www.opengis.net/gml/_Feature></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Place></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Settlement></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/PopulatedPlace></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/extracted-from></span>
+ <span
class="sr"><urn:content-item-sha1-88a2b5f6520df87e4567c06b48e742b7d1c71e9c></span>
<span class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/created></span>
+ <span
class="s">"2011-06-29T14:43:24.295+02:00"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#dateTime></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/creator></span> <span
class="s">"org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/relation></span>
+ <span
class="sr"><urn:enhancement-7af2e1a3-93fc-008f-fc6a-12accf05e58f></span>
<span class="o">.</span>
+
+<span
class="sr"><urn:enhancement-6fbf0996-b518-7130-92f8-00b4f7572031></span>
+ <span class="n">a</span> <span
class="sr"><http://fise.iks-project.eu/ontology/EntityAnnotation></span>
<span class="p">,</span> <span
class="sr"><http://fise.iks-project.eu/ontology/Enhancement></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/confidence></span>
+ <span class="s">"183.53368"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#double></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-label></span>
+ <span class="s">"London"</span><span class="nv">@en</span>
<span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-reference></span>
+ <span class="sr"><http://dbpedia.org/resource/London></span>
<span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-type></span>
+ <span class="sr"><http://www.w3.org/2002/07/owl#Thing></span>
<span class="p">,</span> <span
class="sr"><http://www.opengis.net/gml/_Feature></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Settlement></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Place></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/PopulatedPlace></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/extracted-from></span>
+ <span
class="sr"><urn:content-item-sha1-88a2b5f6520df87e4567c06b48e742b7d1c71e9c></span>
<span class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/created></span>
+ <span
class="s">"2011-06-29T14:43:24.295+02:00"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#dateTime></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/creator></span> <span
class="s">"org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/relation></span>
+ <span
class="sr"><urn:enhancement-7af2e1a3-93fc-008f-fc6a-12accf05e58f></span>
<span class="o">.</span>
+
+<span
class="sr"><urn:enhancement-7af2e1a3-93fc-008f-fc6a-12accf05e58f></span>
+ <span class="n">a</span> <span
class="sr"><http://fise.iks-project.eu/ontology/TextAnnotation></span>
<span class="p">,</span> <span
class="sr"><http://fise.iks-project.eu/ontology/Enhancement></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/confidence></span>
+ <span class="s">"0.9973729691652052"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#double></span> <span
class="p">;</span>
+ <span class="sr"><http://fise.iks-project.eu/ontology/end></span>
+ <span class="s">"29"</span><span class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#int></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/extracted-from></span>
+ <span
class="sr"><urn:content-item-sha1-88a2b5f6520df87e4567c06b48e742b7d1c71e9c></span>
<span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/selected-text></span>
+ <span class="s">"London"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/selection-context></span>
+ <span class="s">"John Smith was born in
London."</span><span class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://fise.iks-project.eu/ontology/start></span>
+ <span class="s">"23"</span><span class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#int></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/created></span>
+ <span
class="s">"2011-06-29T14:43:24.291+02:00"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#dateTime></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/creator></span>
<span
class="s">"org.apache.stanbol.enhancer.engines.opennlp.impl.NEREngineCore"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/type></span>
+ <span class="sr"><http://dbpedia.org/ontology/Place></span>
<span class="o">.</span>
+
+<span
class="sr"><urn:enhancement-4629f59b-7a78-4cb2-e7d6-99ffca61a3a7></span>
+ <span class="n">a</span> <span
class="sr"><http://fise.iks-project.eu/ontology/EntityAnnotation></span>
<span class="p">,</span> <span
class="sr"><http://fise.iks-project.eu/ontology/Enhancement></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/confidence></span>
+ <span class="s">"33.187008"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#double></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-label></span>
+ <span class="s">"City of London"</span><span
class="nv">@en</span> <span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-reference></span>
+ <span
class="sr"><http://dbpedia.org/resource/City_of_London></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-type></span>
+ <span class="sr"><http://www.w3.org/2002/07/owl#Thing></span>
<span class="p">,</span> <span
class="sr"><http://www.opengis.net/gml/_Feature></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Place></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Settlement></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/PopulatedPlace></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/extracted-from></span>
+ <span
class="sr"><urn:content-item-sha1-88a2b5f6520df87e4567c06b48e742b7d1c71e9c></span>
<span class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/created></span>
+ <span
class="s">"2011-06-29T14:43:24.296+02:00"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#dateTime></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/creator></span> <span
class="s">"org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/relation></span>
+ <span
class="sr"><urn:enhancement-7af2e1a3-93fc-008f-fc6a-12accf05e58f></span>
<span class="o">.</span>
+
+<span
class="sr"><urn:enhancement-37173dd0-240a-068c-91c9-60d91a8677e5></span>
+ <span class="n">a</span> <span
class="sr"><http://fise.iks-project.eu/ontology/TextAnnotation></span>
<span class="p">,</span> <span
class="sr"><http://fise.iks-project.eu/ontology/Enhancement></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/confidence></span>
+ <span class="s">"0.8398552823821589"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#double></span> <span
class="p">;</span>
+ <span class="sr"><http://fise.iks-project.eu/ontology/end></span>
+ <span class="s">"10"</span><span class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#int></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/extracted-from></span>
+ <span
class="sr"><urn:content-item-sha1-88a2b5f6520df87e4567c06b48e742b7d1c71e9c></span>
<span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/selected-text></span>
+ <span class="s">"John Smith"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/selection-context></span>
+ <span class="s">"John Smith was born in
London."</span><span class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://fise.iks-project.eu/ontology/start></span>
+ <span class="s">"0"</span><span class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#int></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/created></span>
+ <span
class="s">"2011-06-29T14:43:24.289+02:00"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#dateTime></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/creator></span> <span
class="s">"org.apache.stanbol.enhancer.engines.opennlp.impl.NEREngineCore"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/type></span>
+ <span class="sr"><http://dbpedia.org/ontology/Person></span>
<span class="o">.</span>
+
+<span
class="sr"><urn:enhancement-de0e2c42-f3f6-1fdb-ef30-2650dc398108></span>
+ <span class="n">a</span> <span
class="sr"><http://fise.iks-project.eu/ontology/EntityAnnotation></span>
<span class="p">,</span> <span
class="sr"><http://fise.iks-project.eu/ontology/Enhancement></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/confidence></span>
+ <span class="s">"10.437969"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#double></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-label></span>
+ <span class="s">"John Maynard Smith"</span><span
class="nv">@en</span> <span class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-reference></span>
+ <span
class="sr"><http://dbpedia.org/resource/John_Maynard_Smith></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/entity-type></span>
+ <span class="sr"><http://xmlns.com/foaf/0.1/Person></span>
<span class="p">,</span> <span
class="sr"><http://www.w3.org/2002/07/owl#Thing></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Scientist></span> <span
class="p">,</span> <span
class="sr"><http://dbpedia.org/ontology/Person></span> <span
class="p">;</span>
+ <span
class="sr"><http://fise.iks-project.eu/ontology/extracted-from></span>
+ <span
class="sr"><urn:content-item-sha1-88a2b5f6520df87e4567c06b48e742b7d1c71e9c></span>
<span class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/created></span>
+ <span
class="s">"2011-06-29T14:43:24.304+02:00"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#dateTime></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/creator></span>
<span
class="s">"org.apache.stanbol.enhancer.engines.entitytagging.impl.NamedEntityTaggingEngine"</span><span
class="o">^^</span><span
class="sr"><http://www.w3.org/2001/XMLSchema#string></span> <span
class="p">;</span>
+ <span class="sr"><http://purl.org/dc/terms/relation></span>
+ <span
class="sr"><urn:enhancement-37173dd0-240a-068c-91c9-60d91a8677e5></span>
+</pre></div>
+
+
+<hr />
+<p><em><a href="index.html">Back to index</a></em></p>
+ </div>
+
+ <div id="footer">
+ <div class="copyright">
+ <p>
+ Copyright © 2010 The Apache Software Foundation, Licensed under
+ the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache
License, Version 2.0</a>.
+ <br />
+ Apache, Stanbol and the Apache feather and Stanbol logos are
trademarks of The Apache Software Foundation.
+ </p>
+ </div>
+ </div>
+
+</body>
+</html>
Added: websites/staging/stanbol/trunk/content/stanbol/docs/trunk/engines.html
==============================================================================
--- websites/staging/stanbol/trunk/content/stanbol/docs/trunk/engines.html
(added)
+++ websites/staging/stanbol/trunk/content/stanbol/docs/trunk/engines.html Thu
Jun 30 09:18:42 2011
@@ -0,0 +1,125 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html>
+<head>
+<!--
+
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE- 2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+
+ <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+ <title>Apache Stanbol - </title>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+ <link rel="icon" type="image/png"
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+ <div id="navigation">
+ <img alt="Apache Stanbol" width="220" height="101"
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/>
+ <h1 id="stanbol_links">Stanbol links</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+</ul>
+<h1 id="asf_links">ASF links</h1>
+<ul>
+<li><a href="http://www.apache.org">Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0">License</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html">Become a
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/">Security</a></li>
+</ul>
+ </div>
+
+ <div id="content">
+ <h1 class="title"></h1>
+ <p><em><a href="index.html">Back to index</a></em></p>
+<h1 id="enhancement_engines_and_their_main_features">Enhancement Engines and
their main features</h1>
+<h2 id="preprocessing">Preprocessing</h2>
+<ul>
+<li><strong>Language Identification Engine</strong><ul>
+<li>langage dedection for textual content</li>
+</ul>
+</li>
+<li>
+<p><strong>Metaxa Engine</strong></p>
+<ul>
+<li>text extraction from various documents</li>
+<li>extraction of metadata from documents</li>
+</ul>
+</li>
+</ul>
+<h2 id="natural_language_processing">Natural Language Processing</h2>
+<ul>
+<li><strong>Named Entity Extraction Enhancement Engine</strong> <ul>
+<li>NLP processing</li>
+<li>dedect occurences of persons, places and organizations, mandatory for
every enhancement.</li>
+</ul>
+</li>
+</ul>
+<h2 id="linking_suggestions">Linking Suggestions</h2>
+<ul>
+<li><strong>Named Entity Tagging Engine</strong><ul>
+<li>suggest links to several Linked Data Sources (e.g. dbpedia)</li>
+</ul>
+</li>
+<li>
+<p><strong>Location Enhancement Engine</strong> </p>
+<ul>
+<li>suggests links to geonames.org</li>
+</ul>
+</li>
+<li>
+<p><strong>OpenCalais Enhancement Engine</strong></p>
+<ul>
+<li>integrates service from Open Calais. (Note: You need to provide a key in
order to use this engine)</li>
+</ul>
+</li>
+<li>
+<p><strong>Zemanta Enhancement Engine</strong></p>
+<ul>
+<li>integrates the Zemanta services. (Note: You need to provide a key in order
to use this engine)</li>
+</ul>
+</li>
+</ul>
+<h2 id="postprocessing__other">Postprocessing / Other</h2>
+<ul>
+<li><strong>CachingDereferencerEngine</strong> <ul>
+<li>retrieves additional content for presenting the enhancement results.</li>
+</ul>
+</li>
+<li>
+<p><strong>Refactor Engine</strong>
+ - transforms enhancements according to a target ontology, requires
KRES launcher.</p>
+</li>
+</ul>
+<hr />
+<p><em><a href="index.html">Back to index</a></em></p>
+ </div>
+
+ <div id="footer">
+ <div class="copyright">
+ <p>
+ Copyright © 2010 The Apache Software Foundation, Licensed under
+ the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache
License, Version 2.0</a>.
+ <br />
+ Apache, Stanbol and the Apache feather and Stanbol logos are
trademarks of The Apache Software Foundation.
+ </p>
+ </div>
+ </div>
+
+</body>
+</html>
Added: websites/staging/stanbol/trunk/content/stanbol/docs/trunk/enhancer.html
==============================================================================
--- websites/staging/stanbol/trunk/content/stanbol/docs/trunk/enhancer.html
(added)
+++ websites/staging/stanbol/trunk/content/stanbol/docs/trunk/enhancer.html Thu
Jun 30 09:18:42 2011
@@ -0,0 +1,118 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html>
+<head>
+<!--
+
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE- 2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+
+ <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+ <title>Apache Stanbol - </title>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+ <link rel="icon" type="image/png"
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+ <div id="navigation">
+ <img alt="Apache Stanbol" width="220" height="101"
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/>
+ <h1 id="stanbol_links">Stanbol links</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+</ul>
+<h1 id="asf_links">ASF links</h1>
+<ul>
+<li><a href="http://www.apache.org">Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0">License</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html">Become a
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/">Security</a></li>
+</ul>
+ </div>
+
+ <div id="content">
+ <h1 class="title"></h1>
+ <p><em><a href="index.html">Back to index</a></em></p>
+<h1 id="enhancer">Enhancer</h1>
+<p>This stateless interface allows the caller to submit content to the Apache
Stanbol enhancer engines and get the resulting enhancements formatted as RDF at
once without storing anything on the server-side.</p>
+<p>The content to analyze should be sent in a POST request with the mimetype
specified in the Content-type header. The response will hold the RDF
enhancement serialized in the format specified in the Accept header:</p>
+<div class="codehilite"><pre><span class="n">curl</span> <span
class="o">-</span><span class="n">X</span> <span class="n">POST</span> <span
class="o">-</span><span class="n">H</span> <span class="s">"Accept:
text/turtle"</span> <span class="o">-</span><span class="n">H</span> <span
class="s">"Content-type: text/plain"</span> <span class="o">\</span>
+<span class="o">--</span><span class="n">data</span> <span
class="s">"John Smith was born in London."</span> <span
class="n">http:</span><span class="sr">//</span><span
class="n">localhost:8080</span><span class="o">/</span><span
class="n">engines</span>
+</pre></div>
+
+
+<p>The list of mimetypes accepted as inputs depends on the deployed engines.
By default only text/plain content will be analyzed. By using the Metaxa En</p>
+<h2 id="main_interfaces_and_utilities">Main Interfaces and Utilities</h2>
+<p>A <strong>Content Item</strong> is the unit of content that Stanbol
Enhancer can deal with. It gives access to the binary content that was
registered, and the graph that represents its metadata (provided by client
and/or generated). The <strong>Enhancement Engine</strong> provides the
interface to internal or external semantic enhancement engines. There will
usually be several of those, that the EnhancementJobManager uses to enhance
content items. The <strong>Enhancement Job Manager</strong> accepts requests
for enhancing ContentItems, and processes them either synchronously or
asynchronously (as decided by the enhancement engines or by configuration). The
<strong>Enhancement Engine Helper</strong> provides the classes for the
resulting enhancement structure according to the defined <strong>Enhancement
Structure</strong>.</p>
+<h2 id="enhancement_structure">Enhancement Structure</h2>
+<p>The enhancement structure for Apache Stanbol is been described in full
[here]
+The enhancement structure defines the types and properties used for the
resulting metadata graph of Apache Stanbol. Every <strong>Enhancement</strong>
type is a description which contains the following mandatory properties:</p>
+<ul>
+<li>creator: the specific enhancement engine creating this enhancement</li>
+<li>creation time: the local system time, when the annotation was created</li>
+<li>extracted-from: the content item for the enhancement. This links to the ID
of the content item as assigned by Stanbol.</li>
+<li>type: the type of the enhancement (e.g. Location, Person, Location,
Concept ...).</li>
+<li>confidence: The level of confidence in the range from 0 to 1 </li>
+</ul>
+<p>A <strong>Text Annotation</strong> type provides metadata for the selected
text. This is intended to be used in addition to the enhancement type if an
enhancement is based on a part of the content.</p>
+<ul>
+<li>start: the character position of the start of the selection. If start is
not defined it is assumed, that the selection starts at the beginning of the
document</li>
+<li>end: the character position of the end of the selection. If end is not
defined it is assumed, that the selection ends at the end of the document.</li>
+<li>selected-text: The text selected by the enhancement. (optional).</li>
+<li>selection-context: The context of the selected text. This adds the
possibility to specify the context used to extract entities such as persons,
organizations, locations ... from natural language documents.</li>
+</ul>
+<p>The <strong>Entity Annotation</strong> refer to named entities which have
been recognized within the content. This type is intended to be used together
with the FISE enhancement type.</p>
+<ul>
+<li>entity-reference: This refers to the URI identifying the Entity</li>
+<li>entity-label: The label(s) of the referred entity</li>
+<li>entity-type: This property can be used to specify the type of the entity
(optional) </li>
+</ul>
+<p>The occurrences of the entity within the content (the exact positions
within the text where this entity is referred) are determined by outgoing
dc:relation links.</p>
+<h2 id="response_in_rdf">Response in RDF</h2>
+<p>Apache Stanbol Enhancer is able to serialize the response in the following
RDF formats:</p>
+<div class="codehilite"><pre><span class="n">application</span><span
class="o">/</span><span class="n">json</span> <span class="p">(</span><span
class="n">JSON</span><span class="o">-</span><span class="n">LD</span><span
class="p">)</span>
+<span class="n">application</span><span class="sr">/rdf+xml (RDF/</span><span
class="n">XML</span><span class="p">)</span>
+<span class="n">application</span><span class="sr">/rdf+json (RDF/</span><span
class="n">JSON</span><span class="p">)</span>
+<span class="n">text</span><span class="o">/</span><span
class="n">turtle</span> <span class="p">(</span><span
class="n">Turtle</span><span class="p">)</span>
+<span class="n">text</span><span class="o">/</span><span
class="n">rdf</span><span class="o">+</span><span class="n">nt</span> <span
class="p">(</span><span class="n">N</span><span class="o">-</span><span
class="n">TRIPLES</span><span class="p">)</span>
+</pre></div>
+
+
+<p>By default the URI of the content item being enhanced is a local, non
de-referencable URI automatically built out of a hash digest of the binary
content. Sometimes it might be helpful to provide the URI of the content-item
to be used in the enhancements RDF graph. This can be achieved by passing a URI
request parameter as follows:</p>
+<div class="codehilite"><pre><span class="n">curl</span> <span
class="o">-</span><span class="n">X</span> <span class="n">POST</span> <span
class="o">-</span><span class="n">H</span> <span class="s">"Accept:
text/turtle"</span> <span class="o">-</span><span class="n">H</span> <span
class="s">"Content-type: text/plain"</span> <span class="o">\</span>
+<span class="o">--</span><span class="n">data</span> <span
class="s">"John Smith was born in London."</span> <span
class="o">\</span>
+<span
class="s">"http://localhost:8080/engines?uri=urn:fise-example-content-item"</span>
+</pre></div>
+
+
+<hr />
+<p><em><a href="index.html">Back to index</a></em></p>
+ </div>
+
+ <div id="footer">
+ <div class="copyright">
+ <p>
+ Copyright © 2010 The Apache Software Foundation, Licensed under
+ the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache
License, Version 2.0</a>.
+ <br />
+ Apache, Stanbol and the Apache feather and Stanbol logos are
trademarks of The Apache Software Foundation.
+ </p>
+ </div>
+ </div>
+
+</body>
+</html>
Added: websites/staging/stanbol/trunk/content/stanbol/docs/trunk/features.html
==============================================================================
--- websites/staging/stanbol/trunk/content/stanbol/docs/trunk/features.html
(added)
+++ websites/staging/stanbol/trunk/content/stanbol/docs/trunk/features.html Thu
Jun 30 09:18:42 2011
@@ -0,0 +1,81 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html>
+<head>
+<!--
+
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE- 2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+
+ <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+ <title>Apache Stanbol - </title>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+ <link rel="icon" type="image/png"
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+ <div id="navigation">
+ <img alt="Apache Stanbol" width="220" height="101"
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/>
+ <h1 id="stanbol_links">Stanbol links</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+</ul>
+<h1 id="asf_links">ASF links</h1>
+<ul>
+<li><a href="http://www.apache.org">Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0">License</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html">Become a
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/">Security</a></li>
+</ul>
+ </div>
+
+ <div id="content">
+ <h1 class="title"></h1>
+ <p><em><a href="index.html">Back to index</a></em></p>
+<h1 id="apache_stanbol_features">Apache Stanbol features</h1>
+<ul>
+<li>NLP and Named Entity Recognition via <a
href="http://incubator.apache.org/opennlp/">Apache OpenNLP</a></li>
+<li>Enhancement engines linking to open data such as dbpedia, geonames,
dplb.</li>
+<li>Enhancement engines integrating external services such as zemanta and
opencalais.</li>
+<li>Local Indexes of Linked open Data</li>
+<li>RDF management via <a href="http://incubator.apache.org/clerezza/">Apache
Clerezza</a></li>
+<li>Various enhancement output serialisations (RDF/XML, JSON-LD, RDF/XML,
RDF/JSON, Turtle, N-TRIPLES)</li>
+<li>Language Detection for European Languages</li>
+<li>Extracting embedded metadata and textual content via <a
href="http://aperture.sourceforge.net/">Aperture</a></li>
+</ul>
+<h2 id="upcoming_features_whishlist">Upcoming features whishlist</h2>
+<ul>
+<li>Multilingual support</li>
+</ul>
+<hr />
+<p><em><a href="index.html">Back to index</a></em></p>
+ </div>
+
+ <div id="footer">
+ <div class="copyright">
+ <p>
+ Copyright © 2010 The Apache Software Foundation, Licensed under
+ the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache
License, Version 2.0</a>.
+ <br />
+ Apache, Stanbol and the Apache feather and Stanbol logos are
trademarks of The Apache Software Foundation.
+ </p>
+ </div>
+ </div>
+
+</body>
+</html>
Added: websites/staging/stanbol/trunk/content/stanbol/docs/trunk/index.html
==============================================================================
--- websites/staging/stanbol/trunk/content/stanbol/docs/trunk/index.html (added)
+++ websites/staging/stanbol/trunk/content/stanbol/docs/trunk/index.html Thu
Jun 30 09:18:42 2011
@@ -0,0 +1,156 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html>
+<head>
+<!--
+
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE- 2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+
+ <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+ <title>Apache Stanbol - </title>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+ <link rel="icon" type="image/png"
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+ <div id="navigation">
+ <img alt="Apache Stanbol" width="220" height="101"
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/>
+ <h1 id="stanbol_links">Stanbol links</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+</ul>
+<h1 id="asf_links">ASF links</h1>
+<ul>
+<li><a href="http://www.apache.org">Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0">License</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html">Become a
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/">Security</a></li>
+</ul>
+ </div>
+
+ <div id="content">
+ <h1 class="title"></h1>
+ <h1 id="apache_stanbol_-_services_for_semantic_content_management">Apache
Stanbol - services for semantic content management</h1>
+<hr />
+<p><a href="http://incubator.apache.org/stanbol/">Apache Stanbol</a> is an
open source modular software stack and reusable set of components for semantic
content management.</p>
+<p>Apache Stanbol components are meant to be accessed over RESTful interfaces
to provide semantic services for content management. The current code is
written in Java and based on the OSGi modularization framework.</p>
+<p>Applications include extending existing content management systems with
(internal or external) semantic services, and creating new types of content
management systems with semantics at their core. Its main features will
include:</p>
+<ul>
+<li>Persistence: services that store (or cache) semantic information and make
it searchable</li>
+<li>Lifting/Enhancement: services that add semantic information to
ânon-semanticâ pieces of content</li>
+<li>Knowledge models and reasoning: services that enhance the semantic
information</li>
+<li>Interaction: intelligent user interface management and generation </li>
+</ul>
+<p>Apache Stanbol project was initiated and is part-funded by the European
R&D project <a href="http://www.iks-project.eu/">IKS - Interactive
Knowledge Stack for small to medium CMS providers</a>. </p>
+<p>Online demos of Apache Stanbol are available at </p>
+<ul>
+<li><a
href="http://dev.iks-project.eu:8080/">http://dev.iks-project.eu:8080/</a></li>
+<li><a
href="http://stanbol.demo.nuxeo.com/">http://stanbol.demo.nuxeo.com/</a></li>
+</ul>
+<h2 id="build_install_and_run_apache_stanbol">Build, install and run Apache
Stanbol</h2>
+<p>As there is no official release of <a
href="http://incubator.apache.org/stanbol/">Apache Stanbol</a> so far, one
needs to build the software from source.</p>
+<h3 id="installation">Installation</h3>
+<ol>
+<li>To build Stanbol from source you need Java 6 and maven 2.2.1 + (version as
defined in the pom). You probably need also:</li>
+</ol>
+<p><code>$ export MAVEN_OPTS="-Xmx512M -XX:MaxPermSize=128M </code></p>
+<ol>
+<li>
+<p>Fetch the sources from the Apache code repository</p>
+<p><code>$ svn co https://svn.apache.org/repos/asf/incubator/stanbol/
stanbol</code> </p>
+</li>
+<li>
+<p>From the source directory run:</p>
+<p><code>$ mvn clean install</code></p>
+</li>
+</ol>
+<p><em>Updating sources</em>: From your Stanbol root directory call <code>$
mvn clean</code>, then <code>% svn update</code> and build a new version with
<code>$ mvn clean install</code>.</p>
+<h3 id="launch_the_server">Launch the Server</h3>
+<p>Several launchers are available under the <code>launchers/</code>
directory:</p>
+<ul>
+<li>The <strong>"stable launcher"</strong> provides you with the most stable
components only,</li>
+<li>the <strong>"full launcher"</strong> contains all major components
available, </li>
+<li>the <strong>"stateless launcher"</strong> lets you work with Apache
Stanbol components in offline mode and </li>
+<li>the <strong>"kres launcher"</strong> activates the rules machinery only.
</li>
+</ul>
+<p>You can launch the Server via e.g:</p>
+<p><code>$ java -Xmx1g -jar
full/target/org.apache.stanbol.launchers.full-0.9-SNAPSHOT.jar</code></p>
+<p>Your instance is then available on <a
href="http://localhost:8080">http://localhost:8080</a>. You may change the
default port number by passing a <code>-p 9090</code> options to the
commandline launcher. Upon first startup, a folder named <code>sling/</code> is
created in the current folder. This folder will hold the files for any database
used by Stanbol, deployment configuration and logs.</p>
+<h3 id="http_entry_points">HTTP entry points</h3>
+<p>The web interface of your Apache Stanbol installation provides information
for the main HTTP entry points. Each resource comes with a web view that
documents the matching RESTful API for applications. Depending on which
launcher you use, you will get a different set of <a
href="services.html">services</a>. </p>
+<h2 id="usage_scenarios_for_apache_stanbol">Usage Scenarios for Apache
Stanbol</h2>
+<ul>
+<li><a href="contentenhancement.html">Content Enhancement</a></li>
+</ul>
+<p>Analyze textual content, enhance with with named entities (person, place,
organization), suggest links to open data sources.</p>
+<ul>
+<li>Working with "local" Entities</li>
+</ul>
+<p>Use locally defined entities (e.g. thesaurus concepts) from an
organization's context.<br />
+</p>
+<ul>
+<li>Semantic Search in Portals</li>
+</ul>
+<p>Store/index enhancements and content items. For a portal this would
facilitate semantic search applications.</p>
+<ul>
+<li>Refactoring Enhancements for SEO</li>
+</ul>
+<p>Refactor the enhancement result, its property names and ontology types
according your target ontology.</p>
+<ul>
+<li>Transforming CMS repository structures into ontologies
+ Provide repository structures as thesaurus or domain ontology, e.g.
categories.</li>
+</ul>
+<h2 id="technical_documentation">Technical Documentation</h2>
+<p>A detailed technical documentation of its components can be found here:</p>
+<ul>
+<li><a href="enhancer.html">Enhancer</a></li>
+<li><a href="engines.html">Enhancement Engines</a></li>
+<li>Entityhub</li>
+<li>Contenthub</li>
+<li>CMS Adapter </li>
+<li>Ontology Manager</li>
+<li>Reasoners</li>
+<li>Reengineer</li>
+<li>Rules </li>
+<li>Benchmark</li>
+</ul>
+<h2 id="follow_and_support_apache_stanbol">Follow and support Apache
Stanbol</h2>
+<ul>
+<li>
+<p>Subscribe to the Apache Stanbol Developers mailinglist by sending an email
to <code>stanbol-dev-subscribe AT incubator.apache.org</code>.</p>
+</li>
+<li>
+<p>Please report bugs on the <a
href="https://issues.apache.org/jira/browse/STANBOL">Apache issue
tracker</a></p>
+</li>
+</ul>
+ </div>
+
+ <div id="footer">
+ <div class="copyright">
+ <p>
+ Copyright © 2010 The Apache Software Foundation, Licensed under
+ the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache
License, Version 2.0</a>.
+ <br />
+ Apache, Stanbol and the Apache feather and Stanbol logos are
trademarks of The Apache Software Foundation.
+ </p>
+ </div>
+ </div>
+
+</body>
+</html>
Added: websites/staging/stanbol/trunk/content/stanbol/docs/trunk/services.html
==============================================================================
--- websites/staging/stanbol/trunk/content/stanbol/docs/trunk/services.html
(added)
+++ websites/staging/stanbol/trunk/content/stanbol/docs/trunk/services.html Thu
Jun 30 09:18:42 2011
@@ -0,0 +1,96 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"
"http://www.w3.org/TR/html4/loose.dtd">
+<html>
+<head>
+<!--
+
+ Licensed to the Apache Software Foundation (ASF) under one or more
+ contributor license agreements. See the NOTICE file distributed with
+ this work for additional information regarding copyright ownership.
+ The ASF licenses this file to You under the Apache License, Version 2.0
+ (the "License"); you may not use this file except in compliance with
+ the License. You may obtain a copy of the License at
+
+ http://www.apache.org/licenses/LICENSE- 2.0
+
+ Unless required by applicable law or agreed to in writing, software
+ distributed under the License is distributed on an "AS IS" BASIS,
+ WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ See the License for the specific language governing permissions and
+ limitations under the License.
+-->
+
+ <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+ <title>Apache Stanbol - </title>
+ <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+ <link rel="icon" type="image/png"
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+ <div id="navigation">
+ <img alt="Apache Stanbol" width="220" height="101"
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/>
+ <h1 id="stanbol_links">Stanbol links</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+</ul>
+<h1 id="asf_links">ASF links</h1>
+<ul>
+<li><a href="http://www.apache.org">Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0">License</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html">Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html">Become a
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/">Security</a></li>
+</ul>
+ </div>
+
+ <div id="content">
+ <h1 class="title"></h1>
+ <h1 id="http_entry_points_for_apache_stanbol_services">HTTP entry points
for Apache Stanbol services</h1>
+<p>The web interface of your Apache Stanbol installation provides information
for the main HTTP entry points. Each resource comes with a web view that
documents the matching RESTful API for applications. Depending on which
launcher you use, you will get a different set of services.</p>
+<ul>
+<li>[/engines][stan-engines] (stable)</li>
+</ul>
+<p>This is a stateless interface to allow clients to submit content to be
analyzed by the Enhancement Engines and get the resulting RDF enhancements at
once without storing anything on the server-side.</p>
+<ul>
+<li><a href="http://localhost:8080/entityhub">/entityhub</a> (stable)</li>
+</ul>
+<p>The Entityhub provides two main services: The entity network manages
(external) referenced sites and lets one consume entity information, the
Entityhub supports managing and using local entities.</p>
+<ul>
+<li>[/system/console][stan-admin] (stable)</li>
+</ul>
+<p>This is the OSGi administration console (for administrators and
developers). The initial username / password is set to admin / admin. Use the
console to add new bundles and activate, de-activate and configure components.
The console can also be used to perform hot-(re)deployment of any OSGi bundles.
For instance to re-deploy a new version of this web interface, go to the
<code>$STANBOL_HOME/enhancer/jersey</code> source folder and run the following
command:</p>
+<p><code>$ mvn install -o -DskipTests -PinstallBundle
+ -Dsling.url=http://localhost:8080/system/console</code></p>
+<ul>
+<li><a href="http://localhost:8080/sparql">/sparql</a> (full)</li>
+</ul>
+<p>This is the SPARQL endpoint for the Stanbol store. SPARQL is the standard
query language the most commonly used to provide interactive access to semantic
knowledge bases.</p>
+<ul>
+<li><a href="http://localhost:8080/contenthub">/contenthub</a> (full)</li>
+</ul>
+<p>This is a stateful interface to submit content to analyze and store the
results on the server. It is then possible to browse the resulting enhanced
content items. The longer-term goal of this endpoint is to implement faceted
semantic search of the enhanced content items. (Note: Up to now, this endpoint
is a proof of concept / experimental feature. It does not actually store the
content on the disk, just in memory.)</p>
+<ul>
+<li>
+<p><a href="http://dev.iks-project.eu:8080/ontonet">/ontonet</a> (full)</p>
+</li>
+<li>
+<p><a href="http://dev.iks-project.eu:8080/factstore">/factstore</a> (full)</p>
+</li>
+<li />
+</ul>
+ </div>
+
+ <div id="footer">
+ <div class="copyright">
+ <p>
+ Copyright © 2010 The Apache Software Foundation, Licensed under
+ the <a href="http://www.apache.org/licenses/LICENSE-2.0">Apache
License, Version 2.0</a>.
+ <br />
+ Apache, Stanbol and the Apache feather and Stanbol logos are
trademarks of The Apache Software Foundation.
+ </p>
+ </div>
+ </div>
+
+</body>
+</html>