Hi Shashank,

It looks alright.
I think you can skip the Spark part, as you are not interested in the
project concerning the model building.

As for the specific project you selected I think best would be to:

- Understand how a spotlight model is divided (Surface form store, Context
Store, Candidate Store). Probably this blog [1] entry can help you  as well
as playing with [2]

- Also reading the main paper on which spotlight is based on (I previously
mentioned it but it is also mentioned in the literature at github)

[1]
http://engineering.idioplatform.com/2015/02/23/spotlight-model-editor.html
[2] https://github.com/idio/spotlight-model-editor

On Thu, Mar 12, 2015 at 1:35 PM, shashank juyal <sjuyal...@gmail.com> wrote:

> Hi David,
>
> Please find attached the warm up tasks I have done.
> I am still involved in some of the issues and documentation. I have also
> mentioned those in the pdf.
> Please let me know if any other warm up task has to be done.
>
> Thanks and Regards,
> Shashank Juyal
>
>
>
> On Sun, Mar 8, 2015 at 12:36 AM, David Przybilla <dav.alejan...@gmail.com>
> wrote:
>
>> Hi Shashank,
>>
>> On DBpedia Spotlight – Better Context Vectors:
>>
>> Here are the DBPedia Spotlight warm tasks:
>> https://github.com/dbpedia-spotlight/dbpedia-spotlight/wiki/Warm-up-tasks
>>
>> if you take a look at the github issue page you should find some of the
>> problems we are dealing with. One of the ideas could be experimenting with
>> word2vec.
>>
>> Have a nice weekend :)
>>
>> On Sat, Mar 7, 2015 at 11:46 AM, shashank juyal <sjuyal...@gmail.com>
>> wrote:
>>
>>> Hi,
>>>
>>> I am a Masters student in International Institute of Information
>>> technology, Hyderabad (IIIT-H). I am interested in taking part in this
>>> year's GSOC. Many of the projects in DBPedia sounds very familiar and
>>> interesting to me as I have worked closely with many of the concepts and
>>> technologies used in the project.
>>>
>>> I have worked previously with Wikipedia data and built a small search
>>> over it based on tf-idf score and my own parser. Also currently I am
>>> working in a project "Question Answer techniques using NLP" which uses
>>> concepts like wordtovec, CBOW, NL Processing and translation to query
>>> language, which are mentioned in some of the projects in DBPedia-Spotlight.
>>>
>>> Based on this, I would like to work on the following projects:
>>>
>>> 1) Fact Extraction from Wikipedia Text
>>> 2) Keyword Search on DBpedia
>>> 3) Deploying a DBpedia Question Answering Engine
>>> 4) DBpedia Spotlight – Better Context Vectors
>>>
>>> Please let me know the warm-up tasks in the above projects.
>>>
>>> Linked Profile: in.linkedin.com/in/shajuyal
>>> Github Profile: https://github.com/sjuyal
>>>
>>> Thanks and Regards,
>>> Shashank Juyal
>>>
>>>
>>> ------------------------------------------------------------------------------
>>> Dive into the World of Parallel Programming The Go Parallel Website,
>>> sponsored
>>> by Intel and developed in partnership with Slashdot Media, is your hub
>>> for all
>>> things parallel software development, from weekly thought leadership
>>> blogs to
>>> news, videos, case studies, tutorials and more. Take a look and join the
>>> conversation now. http://goparallel.sourceforge.net/
>>> _______________________________________________
>>> Dbpedia-gsoc mailing list
>>> Dbpedia-gsoc@lists.sourceforge.net
>>> https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
>>>
>>>
>>
>
------------------------------------------------------------------------------
Dive into the World of Parallel Programming The Go Parallel Website, sponsored
by Intel and developed in partnership with Slashdot Media, is your hub for all
things parallel software development, from weekly thought leadership blogs to
news, videos, case studies, tutorials and more. Take a look and join the 
conversation now. http://goparallel.sourceforge.net/
_______________________________________________
Dbpedia-gsoc mailing list
Dbpedia-gsoc@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc

Reply via email to