Dear Sir/Madam, I'm a Chinese PhD student major in Semantic Web. I'm quite interested in contributing to DBpedia Spotlight in GSoC 2012. Among the project ideas, I would like to work on integrating DBpedia Spotlight as Enhancement Engine within Apache Stanbol. I have studied the User Manual [1] and the Technical Documentation [2] of DBpedia Spotlight, as well as the Engine Enhancement development guide from Apache Stanbol [3]. In the source code, SpotlightInterface [4] provides a good demonstration of the programmatic usage DBpedia Spotlight. After the document studying work, I can totally understand the goals and the approaches of this project, and I think I can make it this summer. Here're some questions:
1. The general procedure of DBpedia Spotlight can be broken down in four stages: (1) Spotting, (2) Candidate Selection, (3) Disambiguation, (4) Filtering. There are three web services for users. What stage(s) does each service work on? Am I right for the following mappings? - http://spotlight.dbpedia.org/rest/annotate : (1) (2) (3) - http://spotlight.dbpedia.org/rest/candidates : (1) (2) - http://spotlight.dbpedia.org/rest/disambiguate : (3) We can perform the additional stage (4) in each service for data filtering. Is that true? 2. Should I develop one Enhancement Engine containing three services, or three engines (i.e. each service as an engine)? It's maybe related to the service function granularity. What's your opinion? 3. In [1], you mention that "core/SpotlightFactory" and "rest/Candidates.java" would help understanding the programmatic usage of DBpedia Spotlight. But I can not find them in svn. It's greatly appreciated if you point out their package paths. Or any other important source codes? By the way, my name is Siwei Yu. I have good knowledge of semantic technologies, such as RDF, OWL, SPARQL. I'm also familiar with the mainstream Java based RDF/OWL processing tools like owlapi, Jena, Sesame, AllegroGraph. I have strong Java coding skills with of good knowledge of the software design patterns. My research background meets the requirements very well. I believe it'll be a wonderful summer working with the DBpedia Spotlight community. Looking forward to hearing from you! [1] http://wiki.dbpedia.org/spotlight/usersmanual?v=i0m [2] http://wiki.dbpedia.org/spotlight/technicaldocumentation?v=3qy [3] http://incubator.apache.org/stanbol/docs/trunk/enhancer/engines/ [4] http://dbp-spotlight.svn.sourceforge.net/viewvc/dbp-spotlight/trunk/rest/src/main/java/org/dbpedia/spotlight/web/rest/SpotlightInterface.java?revision=363&view=markup Yours, Siwei Yu ------------------------------------------------------------------------------ This SF email is sponsosred by: Try Windows Azure free for 90 days Click Here http://p.sf.net/sfu/sfd2d-msazure _______________________________________________ Dbp-spotlight-users mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/dbp-spotlight-users
