[Dbpedia-gsoc] Accepted student - ready to incorporate and work ; )

2015-04-28 Thread Pablo Estrada
Hello everyone,
my proposal ("Get up and walk! - Adding live-ness to the Triple Pattern
Fragments server") has been accepted for this year's GSoC. I am very happy
and ready to get on with it.
I am in the middle of the semester, so I'll be a bit busy for a few weeks
still; but I'll start 'bonding' ; )
Any advice is welcome, and see you around!
Btw, should I go and subscribe to the dev list of dbpedia as well?

Thanks for the chance!
Pablo
--
One dashboard for servers and applications across Physical-Virtual-Cloud 
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___
Dbpedia-gsoc mailing list
Dbpedia-gsoc@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc


Re: [Dbpedia-gsoc] Fwd: Contribute to DbPedia

2015-04-28 Thread David Przybilla
Hi Abhishek,

You are free to contribute :) I will try to keep on reviewing PRs
if that is alright.



On Tue, Apr 28, 2015 at 7:47 AM, Abhishek Gupta  wrote:

> Hi all,
>
> My proposal has not been selected for GSoC. But I am still want to
> continue with my project. So can someone provide me any guidelines (if I
> can continue)?
>
> Thanks,
> Abhishek
>
> On Thu, Apr 9, 2015 at 11:53 PM, Abhishek Gupta  wrote:
>
>> Hi Thiago,
>>
>> Thanks for your reply and assurance.
>> Moreover I replied your question for the extraction framework and I have
>> also created an issue regarding using bold instances as the probable
>> surface forms here
>> .
>>
>> Thanks,
>> Abhishek
>>
>> On Thu, Apr 9, 2015 at 1:19 AM, Thiago Galery  wrote:
>>
>>> Hi Abhishek,
>>> sorry for taking so long to write to you. Things at work have been
>>> really busy. About the issue you raised about the originality of your
>>> proposal, rest assured that no one sent a proposal similar to yours.
>>>
>>> I'm happy that you send a PR for the extraction framework. It seems that
>>> Dimitris is already taking a look at it.
>>> As for your suggestions in Spotlight, just removing the stopword filter
>>> is something that I don't advise that much, cause I remember getting a lot
>>> of crap once. Maybe it should be modified somehow. If you have a good idea
>>> and want to send a PR, it would be very welcome. I think discussing things
>>> on github would be better.
>>>
>>> All the best,
>>> Thiago
>>>
>>> On Mon, Apr 6, 2015 at 6:15 AM, Abhishek Gupta 
>>> wrote:
>>>
 Hi all,

 Recently I was checking out the indexing process of dbpedia-spotlight
 and I observe a certain things:

 1) There is a missing constructor definition in wikiPage object
 
  for
 instance defined in function wikiPageCopy here
 .
 For this I have created an PR
 https://github.com/dbpedia/extraction-framework/pull/377

 2) For stopwords filter defined here
 ,
 I did an analysis over the conceptURI's extraction with stopwords list
 here
 .
 From the analysis it came out that we are neglecting around 25481 entities
 in which almost all of them are from important category like music, film,
 band etc. E.g. Am_(musician)
 , Home_(2015_film)
 , The_Who
  etc. And if we do case
 sensitive checking (checking if entity contains more than one capital
 alphabets as one is default) even then we will reject some entities which
 has only one word like Am, Home etc. Moreover the garbage (can't etc.) we
 will incur after removing this filter won't be much. So i suggest if we can
 remove this filter.

 3) I would like to suggest a surface form extraction. If we can extract
 bold text in the first line of the wikipedia then we can use that as
 probable Surface Form for that entity. E.g. Stanford_University
 , Aon_(company)
 , Radio_Warwick
 , Phi_Gamma_Delta
  etc. These are the best
 Surface Forms for the respective Entity.

 Thanks,
 Abhishek

 On Fri, Mar 27, 2015 at 11:56 AM, Abhishek Gupta 
 wrote:

> Hi all,
>
> I would also like to inform that in one of the recent mails my
> proposal has been gone public when Thiago accidentally sent a mail to me
> and dbpedia-gsoc mailing list. Details of the mails are below. The Google
> docs link was there in the quotes and the doc can be seen and even edited
> by anyone with that link, but nobody have changed the content of the doc.
> And I believe there might be chances that someone will copy my ideas. So
> I request you to take care of this issue. And I hope this might not
> affect my application.
> As of now I have changed the sharing settings, so please inform me if
> there will be any access problem.
>
> *Mail details:*
> from:Thiago Galery to:Abhishek Gupta <
> a.gu...@gmail.com>,
> dbpedia-gsoc 
> date:Tue, Mar 24, 2015 at 3:47 AMsubject:Re: [Dbpedia-gsoc] Fwd:
> Contribute to DbPedia
>
> I have also modified my

[Dbpedia-gsoc] Thanks!

2015-04-28 Thread Philipp Dowling
Hello guys,

First of all, thanks a lot for accepting my proposal! I'm really looking
forward to working with you over the summer.

The community bonding period is running until May 25th - is there something
specific I should be doing during this time? I'm only subscribed to this
list currently, are there other lists I should pay attention to, or IRC
channels I should visit?

Cheers,
Philipp
--
One dashboard for servers and applications across Physical-Virtual-Cloud 
Widest out-of-the-box monitoring support with 50+ applications
Performance metrics, stats and reports that give you Actionable Insights
Deep dive visibility with transaction tracing using APM Insight.
http://ad.doubleclick.net/ddm/clk/290420510;117567292;y___
Dbpedia-gsoc mailing list
Dbpedia-gsoc@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/dbpedia-gsoc