Dear Pablo and Rupert,

I'm sorry to post an incomplete email just now. Please ignore the
previous email.

Thanks a lot for your instructions! According to your comments, let me
summarise the current status of the service mapped to the four stages:
(1) Spotting, (2) Candidate Selection, (3) Disambiguation, (4)
Filtering
/annotate: (1), (2), (3)first candidate, (4)
/candidate: (1), (2), (3)all candidate
/disambiguate: (3)
/feedback: not implemented
Please let me know if the previous summary is incorrect.

However, Apache Stanbol each Enhancement Engine in an Enhancement
Chain handles single task respectively (Rupert, is it true?). The
functions of Enhancement Engines are not supposed to overlap others.
We need to adjust the services of DBpedia Spotlight as follows:
/spot: (1), to be implemented in this project, for DBpediaSpotlightSpotEngine
/candidate: (2), to be refactored from current status, for
DBpediaSpotlightCandidateEngine
/disambiguate: (3), to be refactored from current status, for
DBpediaSpotlightDisambiguateEngine
/filter: (4), to be implemented in this project, for
DBpediaSpotlightFilterEngine
As to /annotate, I think it's a complicated service which is not
applicable for Apache Stanbol's "single task for each Enhancement
Engine" requirement. But we can retain it for DBpedia Spotlight for
other users (i.e. not for Apache Stanbol).
The /feedback API could be interesting, which I'd like to try to
implement. More details should be discussed beforehand. However, I'm
not sure there's enough time to complete it in this two-month summer.
I think we can mark it as an optional one for this GSoC project. If
the project is going well before mid-term evaluation in July and the
time is enough, I'll try my best to implement and integrate it into
Apache Stanbol.

If the project scopes discussed above are generally OK, I'd like to
think about the project plan and come up with a project proposal
draft.

By the way, I have two small questions for DBpedia Spotlight Spotting
and Enhancement Chain:
1. For Pablo, it's mentioned in [3] that there're three
implementations for Spotting: Ling Pipe Spotter, Trie Spotter, Ling
Pipe Chunk Spotter. How does /annotate determine which the best
implementation is, for a service request? Can the user choose among
them manually by sending different parameter(s)?
2. For Rupert, could you please show me some examples of Enhancement
Chain? I've studied some Enhancement Engines here [1]. I can
understand how an individual Enhancement Engine works and how to
implement a new one. After studying [2], I find Enhancement Chain a
little confusing. Could you please lead me to the source code of the
implementation of a concrete Enhancement Chain? I want to know the
data I/O interface from one Enhancement Engine to another. In other
words, how do the output of an Enhancement Engine become the input of
another one?

Best regards,
Siwei Yu

[1] http://incubator.apache.org/stanbol/docs/trunk/enhancer/engines/list.html
[2] http://incubator.apache.org/stanbol/docs/trunk/enhancer/chains/
[3] http://wiki.dbpedia.org/spotlight/technicaldocumentation?v=3qy

> On Wed, Mar 21, 2012 at 4:27 PM, Rupert Westenthaler
> <[email protected]> wrote:
>>
>> Hi Siwei Yu, Pablo
>>
>> see my comments inline. To make it better readable I also removed the
>> parts of the mail that are not relevant to my comments.
>>
>> On Wed, Mar 21, 2012 at 12:01 AM, Pablo Mendes <[email protected]> wrote:
>> > On Tue, Mar 20, 2012 at 4:24 PM, Siwei Yu <[email protected]> wrote:
>> >> 2. Should I develop one Enhancement Engine containing three services,
>> >> or three engines (i.e. each service as an engine)? It's maybe related
>> >> to the service function granularity. What's your opinion?
>> >
>> >
>> > We could have one engine for each task separately, and an enhancement chain
>> > should connect them together. We should also introduce a REST API /spot for
>> > (1). We could perhaps make /candidates implement only (2) and make 
>> > /annotate
>> > accept a &verbose=on to act like the current /candidates does.
>> >
>> > Besides all of this reorganization that has to happen, Rupert is the guy
>> > from Stanbol that can help you position your application in that regard.
>> >
>>
>> I fully agree with that.
>>
>> Having separate EnhancementEngines for spotting, candidates selection
>> and disambiguation would provide a lot of additional flexibility to
>> experienced Stanbol users as they could even use parts of the DBpedia
>> Spotlight functionalities within their existing enhancement engines.
>>
>> The definition of a  DBpedia Spotlight EnhancementChain ensures that
>> typical users can use Spotlight without the need to know the inner
>> working. Users would just need to send enhancement requests to
>> "http://{host}:{port}/enhancer/chin/dbpedia"; assuming that the DBpedia
>> Spotlight chain is called "dbpedia". There would even be the
>> possibility to make the Dbpedia Spotlight EnhancementChain the default
>> enhancement chain so that requests to "/enhancer" would be processed
>> by it.
>>
>> >>
>> >> By the way, my name is Siwei Yu. I have good knowledge of semantic
>> >> technologies, such as RDF, OWL, SPARQL. I'm also familiar with the
>> >> mainstream Java based RDF/OWL processing tools like owlapi, Jena,
>> >> Sesame, AllegroGraph. I have strong Java coding skills with of good
>> >> knowledge of the software design patterns. My research background
>> >> meets the requirements very well. I believe it'll be a wonderful
>> >> summer working with the DBpedia Spotlight community.
>> >
>> >
>> > It would be good if you leveraged some of your Semantic Web background in
>> > your application. The idea of a /feedback API, which receives corrections
>> > made by the users could fit well in this regard.
>> >
>>
>> A feedback API is also something that would be interesting for the
>> Stanbol Enhancer.
>>
>> best
>> Rupert Westenthaler
>>
>> --
>> | Rupert Westenthaler             [email protected]
>> | Bodenlehenstraße 11                             ++43-699-11108907
>> | A-5500 Bischofshofen

------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to