Re: Apache Stanbol

srecko joksimovic Mon, 15 Aug 2011 02:10:27 -0700

Hello Mr Westenthaler,

I have configured Apache Stanbol. I can access services using
http://localhost:8080. I tried to do steps that you described, and I did not
have a problem with step 2, but I am not sure that I did step 1 correctly.


When I issued command like this one:

curl -X POST -H "Content-Type: application/rdf+xml" --data
"@acm_ccs-proton_rdfxml.owl" http://localhost:8080/entityhub/entity

I got response:
*curl: (52) Empty reply from server*

This file which I am trying to upload is almost 1MB big. It should not be a
problem with memory.

Do you have any idea what could be a problem? How I can check that I did the
job correctly?

Best,
Srecko Joksimovic

On Tue, Aug 9, 2011 at 4:45 PM, Rupert Westenthaler <
[email protected]> wrote:

> Hallo Mr Srecko Joksimovic
>
> Two initial Notes:
>
> * I am sending this also to the stanbol-dev list, because I am on
> vacation until end of August and will only read/answer mails from time
> to time until than. So maybe others of the Stanbol community will be
> able to answer questions more quickly then myself.
> * I am sending this reply via my gmail account, because somehow I am
> not able to connect to the SMTP server of salzburgresearch.at.
>
> - - -
>
> Actually this is possible by using the TaxonomyLinkingEngine that is
> currently in development.
>
> Basically there are two steps:
>
> (1) upload your Ontology to the Entityhub
> (2) configure an instance of the TaxonomyLinkingEngine to use your
> ontology (as stored by the Entityhub) to enhance your documents
>
>
> for (1) there are two possibilities
>
> (1a) upload your ontology directly to the "/entityhub"
>
> If you have a Stanbol instance running at "http://localhost:8080/"; the
> followng curl command can be used to upload an RDF graph (such as your
> ontology)
>
> curl -X POST -H "Content-Type: application/rdf+xml" --data
> "@{rdfXmlFile}" http://localhost:8080/entityhub/entity
>
> This assumes that your Ontology is encoded as "application/rdf+xml".
> {rdfXmlFile} denotes to the path to the file on the local file system.
> Please also have a look at [1] for a more detailed description on how
> to upload RDF data to the "/entityhub" endpoint.
>
> (1b) manage the RDF data as an own ReferencedSite
>
> The Entityhub also supports the configuration of so called
> ReferencedSites. This allows to manage different RDF datasets (e.g.
> dbpedia.org, geonames.org, IPTC thesaurus [2], your ontology, ...).
> Such sites are read-only and accessible under
> http://localhost:8080/entityhub/site/{siteId}
>
> Stanbol also includes an indexing tool that helps you in creating such
> "referencedSites" for local datasets (such as your Ontology). A
> detailed description of this process can be found at [3]. [2] is a
> specific configuration of [3] for the IPTC thesaurus.
>
> In general for testing I would suggest to use (1a) because it is much
> easer to start with. However (1a) will require to load your ontology
> in memory therefore it will not work for big datasets. In addition
> (1b) allows you to optimize your ontology (by defining mappings)
> during the indexing process and it gives you the possibility to use
> different Ontologies for enhancing your content. Therefore for more
> complex usage scenarios option (1b) is typically the better solution.
>
> (2) Configure the TaxonomylinkingEngine
>
> This Engine is by default included in the Full launcher of Stanbol. If
> you prefer the stable launcher you will need to manually install it
> (e.g. by using the Apache Felix Wenconsole accessible under
> http://localhost:8080/system/console default user: admin pwd: admin).
> The bundle to install can be found at
>
> "{stanbol-trunk}/enhancer/engines/taxonomylinking/target/org.apache.stanbol.enhancer.engine.taxonomy-0.9.0-incubating-SNAPSHOT.jar"
>
> Assuming a running Stanbol Instance that includes the
> TaxonomyLinkingEngine the following steps are required for the
> configuration:
> 1. go the "configuration tab" (
> http://localhost:8080/system/console/configMgr)
> 2. search for "Apache Stanbol Enhancement Engine for Taxonomy linking"
> 3. pres on the [+] button on the end of this line (this will open the
> dialog to configure a new instance of this engine)
> 4. configure the source. If you used (1a) put "entityhub" in case of
> (1b) you must enter the siteId of the referenced site.
> 5. configure the property used to search for labels of the concepts in
> your Ontology. The default is rdfs:label (typically used for labeling
> concepts within ontologies, but you might also want to use a different
> one based on your ontology
>
> I would not recommend to change any other properties because this
> engine is currently under development and changes to this values might
> not be implemented or even worse break this engine.
>
>
>
> best
> Rupert Westenthaler
>
> [1] http://markmail.org/message/plertstj6fx4xutj
> [2] http://markmail.org/message/rgwug74s3u6olrby
> [3]
> http://svn.apache.org/repos/asf/incubator/stanbol/trunk/entityhub/indexing/genericrdf/README.md
>
> Am 08.08.2011, 22:53 Uhr, schrieb srecko joksimovic
> <[email protected]>:
> >
> > Hello Mr Westenthaler,
> >
> > Mr Pereira gave me your contact, in order to ask you a few questions
> about
> > Apache Stanbol. I suppose that you have received Mr Pereira's email which
> he
> > sent to me, and you maybe already know what the problem is.
> >
> > Mr Pereira suggested me to use Apache Stanbol. The idea is to load my
> > ontology, and than to call method which is going to annotate provided
> text,
> > based on loaded ontology. Could you please explain to me how to implement
> > scenario which I described using Apache Stanbol? If you could provide me
> a
> > code example, I would be very grateful.
> >
> > Thank you very much.
> >
> > Best,
> > Srecko Joksimovic
>
> --
> | Rupert Westenthaler             [email protected]
> | Bodenlehenstraße 11                             ++43-699-11108907
> | A-5500 Bischofshofen
>

Re: Apache Stanbol

Reply via email to