Dear Sir/Madam,

I'm a Chinese PhD student major in Semantic Web. I'm quite interested
in contributing to DBpedia Spotlight in GSoC 2012. Among the project
ideas, I would like to work on integrating DBpedia Spotlight as
Enhancement Engine within Apache Stanbol. I have studied the User
Manual [1] and the Technical Documentation [2] of DBpedia Spotlight,
as well as the Engine Enhancement development guide from Apache
Stanbol [3]. In the source code, SpotlightInterface [4] provides a
good demonstration of the programmatic usage DBpedia Spotlight. After
the document studying work, I can totally understand the goals and the
approaches of this project, and I think I can make it this summer.
Here're some questions:

1. The general procedure of DBpedia Spotlight can be broken down in
four stages: (1) Spotting, (2) Candidate Selection, (3)
Disambiguation, (4) Filtering. There are three web services for users.
What stage(s) does each service work on? Am I right for the following
mappings?
- http://spotlight.dbpedia.org/rest/annotate : (1) (2) (3)
- http://spotlight.dbpedia.org/rest/candidates : (1) (2)
- http://spotlight.dbpedia.org/rest/disambiguate : (3)
We can perform the additional stage (4) in each service for data
filtering. Is that true?

2. Should I develop one Enhancement Engine containing three services,
or three engines (i.e. each service as an engine)? It's maybe related
to the service function granularity. What's your opinion?

3. In [1], you mention that "core/SpotlightFactory" and
"rest/Candidates.java" would help understanding the programmatic usage
of DBpedia Spotlight. But I can not find them in svn. It's greatly
appreciated if you point out their package paths. Or any other
important source codes?

By the way, my name is Siwei Yu. I have good knowledge of semantic
technologies, such as RDF, OWL, SPARQL. I'm also familiar with the
mainstream Java based RDF/OWL processing tools like owlapi, Jena,
Sesame, AllegroGraph. I have strong Java coding skills with of good
knowledge of the software design patterns. My research background
meets the requirements very well. I believe it'll be a wonderful
summer working with the DBpedia Spotlight community.

Looking forward to hearing from you!

[1] http://wiki.dbpedia.org/spotlight/usersmanual?v=i0m
[2] http://wiki.dbpedia.org/spotlight/technicaldocumentation?v=3qy
[3] http://incubator.apache.org/stanbol/docs/trunk/enhancer/engines/
[4] 
http://dbp-spotlight.svn.sourceforge.net/viewvc/dbp-spotlight/trunk/rest/src/main/java/org/dbpedia/spotlight/web/rest/SpotlightInterface.java?revision=363&view=markup

Yours,
Siwei Yu

------------------------------------------------------------------------------
This SF email is sponsosred by:
Try Windows Azure free for 90 days Click Here 
http://p.sf.net/sfu/sfd2d-msazure
_______________________________________________
Dbp-spotlight-users mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users

Reply via email to