Author: buildbot
Date: Tue Feb  7 08:44:03 2012
New Revision: 804080

Log:
Staging update by buildbot for stanbol

Added:
    websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contenthub/
    
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contenthub/index.html
Modified:
    websites/staging/stanbol/trunk/content/stanbol/docs/trunk/components.html

Modified: 
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/components.html
==============================================================================
--- websites/staging/stanbol/trunk/content/stanbol/docs/trunk/components.html 
(original)
+++ websites/staging/stanbol/trunk/content/stanbol/docs/trunk/components.html 
Tue Feb  7 08:44:03 2012
@@ -90,7 +90,7 @@
 <p>The <a href="entityhub.html">Entityhub</a> is the component, which lets you 
cache and manage local indexes of repositories such as dbpedia but also custom 
data (e.g. product descriptions, contact data, specialized topic thesauri).</p>
 </li>
 <li>
-<p>The <a href="contenthub.html">Contenthub</a> provides you with a combined 
index of your content items together with enhancements to facilitate semantic 
search on your entire knowledge base.</p>
+<p>The <a href="contenthub/">Contenthub</a> provides you with a combined index 
of your content items together with enhancements to facilitate semantic search 
on your entire knowledge base.</p>
 </li>
 <li>
 <p>The <a href="factstore/">FactStore</a> is a component that let's use store 
relations between entities identified by their URIs. This relation between two 
entities is called a <em>fact</em>.</p>

Added: 
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contenthub/index.html
==============================================================================
--- 
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contenthub/index.html 
(added)
+++ 
websites/staging/stanbol/trunk/content/stanbol/docs/trunk/contenthub/index.html 
Tue Feb  7 08:44:03 2012
@@ -0,0 +1,117 @@
+<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" 
"http://www.w3.org/TR/html4/loose.dtd";>
+<html>
+<head>
+<!--
+
+    Licensed to the Apache Software Foundation (ASF) under one or more
+    contributor license agreements.  See the NOTICE file distributed with
+    this work for additional information regarding copyright ownership.
+    The ASF licenses this file to You under the Apache License, Version 2.0
+    (the "License"); you may not use this file except in compliance with
+    the License.  You may obtain a copy of the License at
+
+       http://www.apache.org/licenses/LICENSE- 2.0
+
+    Unless required by applicable law or agreed to in writing, software
+    distributed under the License is distributed on an "AS IS" BASIS,
+    WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+    See the License for the specific language governing permissions and
+    limitations under the License.
+-->
+
+  <link href="/stanbol/css/stanbol.css" rel="stylesheet" type="text/css">
+  <title>Apache Stanbol - ContentHub</title>
+  <meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
+  <link rel="icon" type="image/png" 
href="/stanbol/images/stanbol-logo/stanbol-favicon.png"/>
+</head>
+
+<body>
+  <div id="navigation"> 
+  <a href="/stanbol/index.html"><img alt="Apache Stanbol" width="220" 
height="101" border="0" 
src="/stanbol/images/stanbol-logo/stanbol-2010-12-14.png"/></a>
+  <h1 id="stanbol">Stanbol</h1>
+<ul>
+<li><a href="/stanbol/index.html">Home</a></li>
+<li><a href="/stanbol/docs/trunk/tutorial.html">Tutorial</a></li>
+<li><a href="/stanbol/docs/trunk/">Documentation</a></li>
+<li><a href="/stanbol/docs/trunk/building.html">Building</a></li>
+</ul>
+<h1 id="project">Project</h1>
+<ul>
+<li><a href="/stanbol/docs/trunk/mailinglists.html">Mailing Lists</a></li>
+<li><a href="https://issues.apache.org/jira/browse/STANBOL";>Issue 
Tracker</a></li>
+<li><a href="/stanbol/team.html">Project Team</a></li>
+<li><a href="http://www.apache.org/licenses/LICENSE-2.0";>License</a></li>
+</ul>
+<h1 id="downloads">Downloads</h1>
+<ul>
+<li><a href="/stanbol/docs/trunk/downloads.html">Overview</a></li>
+</ul>
+<h1 id="the_asf">The ASF</h1>
+<ul>
+<li><a href="http://www.apache.org";>Apache Software Foundation</a></li>
+<li><a href="http://www.apache.org/foundation/thanks.html";>Thanks</a></li>
+<li><a href="http://www.apache.org/foundation/sponsorship.html";>Become a 
Sponsor</a></li>
+<li><a href="http://www.apache.org/security/";>Security</a></li>
+</ul>
+  </div>
+  
+  <div id="content">
+    <h1 class="title">ContentHub</h1>
+    <p>The Apache Stanbol Contenthub is a persistent document store which 
enables text based document submission 
+and semantic search together with faceted search capability on submitted 
documents.</p>
+<h2 id="technical_description_of_its_components">Technical Description of its 
components</h2>
+<h3 id="contenthub_store">ContentHub Store</h3>
+<p>It is the subcomponent that actually stores the document and its metadata 
persistently. In current implementation only text/plain documents are 
allowed.</p>
+<p>The storage part of the Contenthub provide basic methods such as create, 
put, get and delete. When a document is submitted, it delegates the textual 
content to Stanbol Enhancer to get its enhancements. While submitting the 
document, it is also possible to specify external metadata as field value pairs 
to the document. </p>
+<p>The document itself and all specified external metadata are indexed through 
an embedded Apache Solr core which is created specifically for Contenthub. 
+Since documents are given unique IDs while indexing, using its unique ID, a 
document can be retrieved or deleted from Contenthub. 
+ContentHub provides an HTML interface for its functionalities under the 
following endpoint, which is available after running the full launcher of 
Apache Stanbol:</p>
+<div class="codehilite"><pre><span class="n">http:</span><span 
class="sr">//</span><span class="n">localhost:8080</span><span 
class="o">/</span><span class="n">contenthub</span>
+</pre></div>
+
+
+<h3 id="contenthub_search">ContentHub Search</h3>
+<p>ContentHub has a semantic search subcomponent that allows search over the 
submitted documents. A HTML interface for search functionality can be reached 
under:</p>
+<div class="codehilite"><pre><span class="n">http:</span><span 
class="sr">//</span><span class="n">localhost:8080</span><span 
class="sr">/contenthub/s</span><span class="n">earch</span>
+</pre></div>
+
+
+<p>To start a search, one enters a keyword and choose the search engines that 
will execute the query. After having the first search results, all facets and 
values of these facets will also arrive. Later on, when a facet constraint is 
chosen, documents and facets will be dynamically updated according to chosen 
constraint(s).</p>
+<p>The Contenthub Search API also provides a means of specifying an ontology 
which carries semantic information to make the search more semantic. How this 
external ontology is exploited is explained within the search engine 
documentation below. Furthermore, Search API enables specifying constraints for 
the search operation. The aim is to provide faceted search functionality 
through Java interface based on the specified constraints.<br />
+</p>
+<p>The search part of this component is formed by several search engines that 
work sequentially and contribute to the search results. Each search engine 
works with a given search context. The initialization of the search context is 
performed before the execution of any search engine. Each search engine makes 
use of the information embedded in the search context and populates the context 
with new results, such as resulting documents, related ontological resources, 
new keywords etc ...</p>
+<p>Currently, three search engines are active in search subcomponent:</p>
+<h4 id="ontology_resource_search_engine">Ontology Resource Search Engine</h4>
+<p>This engine works when an additional ontology is specified at the beginning 
of the search. A SPARQL query based on a LARQ index is executed on the 
specified ontology to find individuals and classes related with the keyword. 
When a class is found, it is added to search the context as a related class 
resource and then, subclasses, superclasses and instances of all these classes 
are found and added to the search context.</p>
+<p>When an individual about keyword is found it is added as a related 
individual resource to search context and it's classes are found. These classes 
are added to the search context using the same methodology explained in the 
previous paragraph.</p>
+<h4 id="enhancement_search_engine">Enhancement Search Engine</h4>
+<p>This engine designed to work on enhancement graph which contains all 
enhancements of content items submitted to the Contenthub. </p>
+<p>When a document is submitted to ContentHub, its content is enhanced 
automatically by Enhancer component. 
+In a single Clerezza graph, all the enhancements are kept together and this 
graph is indexed with LARQ. The LARQ index is automatically updated when a new 
enhancement is added.</p>
+<p>Enhancement Search Engine, executes a SPARQL query on enhancement graph to 
find enhancements about the given keyword.
+When an enhancement is found, the document from which the enhancement was 
obtained is added to search context as a related document resource.</p>
+<h4 id="solr_search_engine">SOLr Search Engine</h4>
+<p>The <a href="">SOLr</a> Search Engine is the engine that gives full-text 
and faceted search capabilities to the Contenthub.</p>
+<p>Since every document is indexed to SOLr (to the core created for 
Contenthub), it is possible to do full-text
+search over documents' content and metadata. After the first search, all the 
facet constraints of resulting documents will be available for faceted search. 
When a facet constraint is chosen, resulting documents and facet constraints 
are updated dynamically. </p>
+<p>Later on, related class and individual resources about the keyword, which 
are found by Ontology Resource Search Engine, are searched over SOLr using 
their resource name. </p>
+<p>After all, document resources founded by Enhancement Search Engine is 
examined. If there is a document whose field values does not match with facet 
constraints, these document resources are removed from the search results.</p>
+<h2 id="building_and_launching_contenthub">Building and Launching 
ContentHub</h2>
+<p>Since ContentHub is built with Apache Stanbol it can be launched under 
"Full Launcher". For detailed instructions to build and launch Apache Stanbol 
see the README file through the following link:</p>
+<div class="codehilite"><pre><span class="n">http:</span><span 
class="sr">//s</span><span class="n">vn</span><span class="o">.</span><span 
class="n">apache</span><span class="o">.</span><span class="n">org</span><span 
class="sr">/repos/</span><span class="n">asf</span><span 
class="sr">/incubator/s</span><span class="n">tanbol</span><span 
class="sr">/trunk/</span><span class="n">README</span><span 
class="o">.</span><span class="n">md</span>
+</pre></div>
+  </div>
+  
+  <div id="footer">
+    <div class="copyright">
+      <p>
+        Copyright &copy; 2010 The Apache Software Foundation, Licensed under 
+        the <a href="http://www.apache.org/licenses/LICENSE-2.0";>Apache 
License, Version 2.0</a>.
+        <br />
+        Apache, Stanbol and the Apache feather and Stanbol logos are 
trademarks of The Apache Software Foundation.
+      </p>
+    </div>
+  </div>
+  
+</body>
+</html>


Reply via email to