http://{stanbol}/system/console/configMgr sorry

On 15/07/13 18:15, Sergio Fernández wrote:
Have you check the

1) go to http://{stanbol}/config/system/console/configMgr

2) find your EntityHub Linking engine

3) and then "Link ProperNouns only"

The documentation in that configuration is quite useful I think:

"If activated only ProperNouns will be matched against the Vocabulary.
If deactivated any Noun will be matched. NOTE that this parameter
requires a tag of the POS TagSet to be mapped against 'olia:PorperNoun'.
Otherwise mapping will not work as expected.
(enhancer.engines.linking.properNounsState)"

Hope this help. You have to take into account such kind of issues are
not easy to solve by email.

Cheers,

On 15/07/13 16:31, Sawhney, Tarandeep Singh wrote:
Thanks Sergio for your response

What i understand is to enable option *"Link ProperNouns only"* in
entityhub linking and also to use "opennlp-pos" engine in my weighted
chain

I did these changes but unable to extract "University of Salzberg"

Please find below the output RDF/XML from enhancer

Request you to please let me know if i did not understand your inputs
correctly

One more thing, in our ontology (yet to be built) we will have entities
which are other than people, places and organisations. For example,
belts,
bags etc

best regards
tarandeep

<rdf:RDF
     xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#";
     xmlns:j.0="http://purl.org/dc/terms/";
     xmlns:j.1="http://fise.iks-project.eu/ontology/"; >
   <rdf:Description
rdf:about="urn:enhancement-197792bf-f1e8-47bf-626a-3cdfbdb863b3">
     <j.0:type rdf:resource="http://purl.org/dc/terms/LinguisticSystem"/>
     <j.1:extracted-from
rdf:resource="urn:content-item-sha1-3b2998e66582544035454850d2dd81755b747849"/>

     <j.1:confidence
rdf:datatype="http://www.w3.org/2001/XMLSchema#double";>0.9999964817340454</j.1:confidence>

     <rdf:type
rdf:resource="http://fise.iks-project.eu/ontology/Enhancement"/>
     <rdf:type
rdf:resource="http://fise.iks-project.eu/ontology/TextAnnotation"/>
     <j.0:language>en</j.0:language>
     <j.0:created
rdf:datatype="http://www.w3.org/2001/XMLSchema#dateTime";>2013-07-15T14:25:43.829Z</j.0:created>

     <j.0:creator
rdf:datatype="http://www.w3.org/2001/XMLSchema#string";>org.apache.stanbol.enhancer.engines.langdetect.LanguageDetectionEnhancementEngine</j.0:creator>

   </rdf:Description>
</rdf:RDF>



On Mon, Jul 15, 2013 at 7:32 PM, Sergio Fernández <
sergio.fernan...@salzburgresearch.at> wrote:

As I said: have you check the proper noun detection and POS tagging in
your chain?

For instance, enhancing the text "I studied at the University of
Salzburg,
which is based in Austria" works at the demo server:

http://dev.iks-project.eu:**8081/enhancer/chain/dbpedia-**proper-noun<http://dev.iks-project.eu:8081/enhancer/chain/dbpedia-proper-noun>


Here the details:

http://stanbol.apache.org/**docs/trunk/components/**enhancer/engines/**
entitylinking#proper-noun-**linking-**wzxhzdk14enhancerengineslinkin**
gpropernounsstatewzxhzdk15<http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entitylinking#proper-noun-linking-wzxhzdk14enhancerengineslinkingpropernounsstatewzxhzdk15>


Cheers,



On 15/07/13 15:27, Sawhney, Tarandeep Singh wrote:

Just to add to my previous email

If i add another individual in my ontology "MyUniversity" under class
University



      <!--
http://www.semanticweb.org/**vi5/ontologies/2013/6/**
untitled-ontology-13#**MyUniversity--<http://www.semanticweb.org/vi5/ontologies/2013/6/untitled-ontology-13#MyUniversity-->



      <owl:NamedIndividual rdf:about="
http://www.semanticweb.org/**vi5/ontologies/2013/6/**
untitled-ontology-13#**MyUniversity<http://www.semanticweb.org/vi5/ontologies/2013/6/untitled-ontology-13#MyUniversity>

">
          <rdf:type rdf:resource="
http://www.semanticweb.org/**vi5/ontologies/2013/6/**
untitled-ontology-13#**University<http://www.semanticweb.org/vi5/ontologies/2013/6/untitled-ontology-13#University>

"/>
          <rdfs:label>MyUniversity</**rdfs:label>
      </owl:NamedIndividual>


So with all configurations i have mentioned in the word document (in
google
drive folder), when i pass text with "MyUniversity" in it, my
enhancement
chain is able to extract "MyUniversity" and link it with
"University" type

But same set of configurations doesn't work with individual
"University of
Salzburg"

If anyone of you please provide help on what are we missing to be
able to
extract custom entities which has space in between, will be a great
help
to
proceed further on our journey with using and contributing to stanbol

with best regards,
tarandeep



On Mon, Jul 15, 2013 at 5:57 PM, Sawhney, Tarandeep Singh <
tsawh...@innodata.com> wrote:

  Thanks Sergio and Dileepa for your responses

We haven't been able to resolve the issue. We therefore decided to
keep
just one class and one instance value "University of Salzburg" in our
custom ontology and try to extract this entity and also link it but we
could not get this running. I am sure we are missing some
configurations.

I am sharing a google drive folder at below link

https://drive.google.com/**folderview?id=0B-**
vX9idwHlRtRFFOR000ZnBBOWM&usp=**sharing<https://drive.google.com/folderview?id=0B-vX9idwHlRtRFFOR000ZnBBOWM&usp=sharing>


This folder has 3 files:

1) A word document which shows felix snapshots of what all
configurations
we did while configuring Yard, yardsite, entiy linking engine and
weighted
chain
2) our custom ontology
3) the result of SPARQL against our graphuri using SPARQL endpoint

May i request you all to please look at these files and let us know
if we
are missing something in configurations.

We have referred to below web links in order to configure stanbol for
using our custom ontology for entity extraction and linking

http://stanbol.apache.org/**docs/trunk/customvocabulary.**html<http://stanbol.apache.org/docs/trunk/customvocabulary.html>

http://stanbol.apache.org/**docs/trunk/components/**
entityhub/managedsite<http://stanbol.apache.org/docs/trunk/components/entityhub/managedsite>


http://stanbol.apache.org/**docs/trunk/components/**enhancer/engines/**

entityhublinking<http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entityhublinking>


http://stanbol.apache.org/**docs/trunk/components/**
enhancer/chains/weightedchain.**html<http://stanbol.apache.org/docs/trunk/components/enhancer/chains/weightedchain.html>


Thanks in advance for your valuable help.

Best regards
tarandeep



On Sat, Jul 13, 2013 at 5:57 PM, Sergio Fernández <
sergio.fernandez@**salzburgresearch.at<sergio.fernan...@salzburgresearch.at>>

wrote:

  Hi,

I'm not an expert on entity linking, but from my experience such
behaviour could be caused by the proper noun detection. Further
details
at:

http://stanbol.apache.org/****docs/trunk/components/**<http://stanbol.apache.org/**docs/trunk/components/**>

enhancer/engines/**entitylinking<http://stanbol.**
apache.org/docs/trunk/**components/enhancer/engines/**entitylinking<http://stanbol.apache.org/docs/trunk/components/enhancer/engines/entitylinking>




In addition, I'd like to suggest you to take a look to the
netiquette in
mailing lists. This is an open source community; therefore messages
starting with "URGENT" are not very polite. Specially sending it on
Friday
afternoon, when people could be already out for weekend, or even on
vacations.

Best,
Sergio



On 12/07/13 15:54, Sethi, Keval Krishna wrote:

  Hi,

I am using stanbol to extract entitiies by plugging custom
vocabulary
as
per
http://stanbol.apache.org/****docs/trunk/customvocabulary.****html<http://stanbol.apache.org/**docs/trunk/customvocabulary.**html>

<http://stanbol.apache.**org/docs/trunk/**customvocabulary.html<http://stanbol.apache.org/docs/trunk/customvocabulary.html>




Following are the steps followed -

    Configured Clerezza Yard.
    Configured Managed Yard site.
    Updated the site by plugging ontology(containing custom
entities) .
    Configured Entity hub linking Engine(*customLinkingEngine*) with
managed
site.
    Configured a customChain which uses following engine

      -  *langdetect*
      - *opennlp-sentence*
      - *opennlp-token*
      - *opennlp-pos*
      - *opennlp-chunker*
      - *customLinkingEngine*

Now, i am able to extract entities like Adidas using *customChain*.

However i am facing an issue in extracting entities which has
space in
between. For example "Tommy Hilfiger".

Chain like *dbpedia-disambiguation *(which comes bundeled with
stanbol
instance) is rightly extracting entities like  "Tommy Hilfiger".

I had tried configuring  *customLinkingEngine* same as *
dbpedia-disamb-linking *(configured in *dbpedia-disambiguation* )
but
it
didn't work to extract above entity.

I have invested more than a week now and running out of options now

i request you to please provide help in resolving this issue


  --
Sergio Fernández
Salzburg Research
+43 662 2288 318
Jakob-Haringer Strasse 5/II
A-5020 Salzburg (Austria)
http://www.salzburgresearch.at





--
Sergio Fernández
Salzburg Research
+43 662 2288 318
Jakob-Haringer Strasse 5/II
A-5020 Salzburg (Austria)
http://www.salzburgresearch.at




--
Sergio Fernández
Salzburg Research
+43 662 2288 318
Jakob-Haringer Strasse 5/II
A-5020 Salzburg (Austria)
http://www.salzburgresearch.at

Reply via email to