Matthias Samwald wrote:
Hi David,

Thanks for the hint. I remember trying LUPedia a few months ago -- now it has a defined API, which is a good addition. Unfortunately, the quality of results could be improved quite a bit.

Here is a scientific statement that I would like to see annotated:
"Albizia julibrissin has anxiolytic-like effects that are mediated by the changes of the serotonergic nervous system, especially 5-HT1A receptors."
Since MarkMail hasn't indexed this page, could you make an HTML page somewhere, with the excerpt above in a paragraph, then reply with the doc URL, so I can quickly test to see how close we can get to what you seek via our Sponger Middleware.


Kingsley

LUPedia is unable to identify any entities in this string, although DBpedia would contain them.
http://dbpedia.org/resource/Albizia_julibrissin
http://dbpedia.org/resource/Anxiolytic
http://dbpedia.org/page/5-HT1A_receptor

et cetera.

It seems to recognize person names, as for the string "Michael Jackson", the following URIs are returned:

# http://dbpedia.org/resource/Parademon
# http://dbpedia.org/resource/Michael_Jackson

The first result is a bit puzzling (DBpedia tells me that 'In the DC Universe, Parademons are monstrous shock troops of Apokolips used by Darkseid to maintain the order of Apokolips.').

LUPedia does not seem to do any kind of stemming either, as submitting the string "Michael Jacksons" reduces the list of extracted URIs to:

# http://dbpedia.org/resource/Parademon

LUPedia in its current form will not perform too well in practical settings.

Cheers,
Matthias Samwald

--------------------------------------------------
From: "Davide Palmisano" <dav...@asemantics.com>
Sent: Tuesday, February 02, 2010 2:27 PM
To: "Matthias Samwald" <samw...@gmx.at>
Cc: <public-lod@w3.org>
Subject: Re: DBpedia-based entity recognition service / tool?

Hi Matthias,

have you ever tried this http://lupedia.ontotext.com/ ? Perhaps it may help.

cheers,

Davide

On Tue, Feb 2, 2010 at 1:26 PM, Matthias Samwald <samw...@gmx.at> wrote:
Dear LOD community,

I would be glad to hear your advice on how to best accomplish a simple task: extracting DBpedia entities (identified with DBpedia URIs) from a string of
text. With good accuracy and recall, possibly with some options to
constraint the recognized entities to some subset of DBpedia, based on
categories. The tool or service should be performant enough to process large
numbers of strings in a reasonable amount of time.
Given the prolific creation of tiny tools and services in this community I am puzzled about my inability to find anything that accomplishes this task.
Could you point me to something like that? Are there tools/services for
Wikipedia that I could use?
Zemanta seems to be too much geared towards 'enhanced blogging', while
OpenCalais does not return Wikipedia/DBpedia identifiers. Please correct me
if I am wrong.

Cheers,
Matthias






--

Regards,

Kingsley Idehen President & CEO OpenLink Software Web: http://www.openlinksw.com
Weblog: http://www.openlinksw.com/blog/~kidehen
Twitter: kidehen





Reply via email to