Hi Nishant & welcome to the DBpedia community.

The idea you chose is very interesting and needed in order to move DBpedia
forward.
One thing that you should know is that you will have to work with Scala
code (and maybe a little Java) so one of your main tasks is to get familiar
with Scala. [1]
Before you do you might also want to try out the extraction framework.
Download and extract a few languages and get familiar with the options and
the output data [2]
Try to create a few more tests for the existing tests we have in the code
(e.g. datetime extraction in your language [3]) and submit them with a pull
request [4]
IntelliJ [5] might help in this regard.

Documentation sometimes might not be perfect [6] ;) so if you get stuck
somewhere, please ask us. And if you find something missing in the docs,
feel free to add it.

Best,
Dimitris


[1] http://twitter.github.io/scala_school/
[2]
https://github.com/dbpedia/extraction-framework/wiki/Extraction-Instructions
[3]
https://github.com/dbpedia/extraction-framework/blob/master/core/src/test/scala/org/dbpedia/extraction/dataparser/DateTimeParserTest.scala
[4] https://github.com/dbpedia/extraction-framework/wiki/Contributing
[5]
https://github.com/dbpedia/extraction-framework/wiki/Setting-up-IntelliJ-IDEA
[6]
http://yannesposito.com/Scratch/img/blog/Yesod-tutorial-for-newbies/owl_draw.png



On Thu, Feb 27, 2014 at 7:12 PM, Nishant Prateek <npratee...@gmail.com>wrote:

> Hi,
>
> I am Nishant. I am a sophomore undergraduate student at International
> Institute of Information Technology, Hyderabad. It is one of the leading
> research institutes in India. I am pursuing my B.Tech. in Computer Science
> with MS by research in Computational Linguistics (five-year dual degree).
>
>
> My academic interest lies in the field of Natural Language Processing and
> Information Retrieval. I was going through the list of organizations in
> GSoC'14 and found the work going on in DBPedia very interesting.
>
> For my GSoC summer project, I would like to work on Testing Crowdsourcing.
> Can you please suggest me some quick tasks or some good tutorials to
> familiarize myself with the work.
>
> I am good at programming and am fairly comfortable with programming in
> C/C++ and Python.
>
> Thank you.
>
>
> ------------------------------------------------------------------------------
> Flow-based real-time traffic analytics software. Cisco certified tool.
> Monitor traffic, SLAs, QoS, Medianet, WAAS etc. with NetFlow Analyzer
> Customize your own dashboards, set traffic alerts and generate reports.
> Network behavioral analysis & security monitoring. All-in-one tool.
>
> http://pubads.g.doubleclick.net/gampad/clk?id=126839071&iu=/4140/ostg.clktrk
> _______________________________________________
> Dbpedia-gsoc mailing list
> Dbpedia-gsoc@lists.sourceforge.net
> https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc
>
>


-- 
Kontokostas Dimitris
------------------------------------------------------------------------------
Flow-based real-time traffic analytics software. Cisco certified tool.
Monitor traffic, SLAs, QoS, Medianet, WAAS etc. with NetFlow Analyzer
Customize your own dashboards, set traffic alerts and generate reports.
Network behavioral analysis & security monitoring. All-in-one tool.
http://pubads.g.doubleclick.net/gampad/clk?id=126839071&iu=/4140/ostg.clktrk
_______________________________________________
Dbpedia-gsoc mailing list
Dbpedia-gsoc@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc

Reply via email to