Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining (John Erling Blad)

karthik prasad Sat, 31 Mar 2012 06:29:33 -0700

Thank you very much for your feedback Jeblad.

I will immediately look into how this can be best implemented by extending
the Mediawiki API.
Do kindly let me know about my other ideas so that I can shape my proposal
well.


The mentor for ideas I am interested in is Oren Bochman. But I couldn't
track him on the irc.
I would love to interact with him or any other mentor and discuss my ideas
in detail.

I am recahable at
Email      : karthikprasad...@gmail.com
SkypeID  : prasadkarthik
Facebook: facebook.com/prasadkarthik
Google+  : gplus.to/karthikprasad
twitter      : twitter.com/_karthikprasad


Date: Sat, 31 Mar 2012 12:05:00 +0200
> From: John Erling Blad <jeb...@gmail.com>
> To: Wikimedia developers <wikitech-l@lists.wikimedia.org>
> Subject: Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining
> Message-ID:
>        <CAJcMX2=pm-fcm4dg33uwfcmyhy1rj4hte-gpd2mjbzugzcd...@mail.gmail.com
> >
> Content-Type: text/plain; charset=windows-1252
>
> Your point (a) "Implementing a wikiSumarizer widget which will give the
> summary of the page being read by the user" could be extremely usefull for
> a hover/ helpbubbles functionality where bubbles with a small explanations
> are created within external articles. Such functionality imply creating an
> extension to the Mediawiki API.
>
> Jeblad
>
> On Sat, Mar 31, 2012 at 11:09 AM, karthik prasad <
> karthikprasad...@gmail.com
> > wrote:
>
> > Hello,
> > I am Karthik from India - currently pursuing 3rd year Bachelors in
> Computer
> > Science and Engineering in PESIT, Bangalore.
> >
> > I am interested in some of the projects proposed for Google SOC 2012 and
> > would love to work and contribute the same to the open-source world.
> >
> > I am very attracted towards Text Processing and Data Mining. I have
> > undertaken course in Natural Language Processing. I am currently working
> on
> > a project "Automatic Essay Grader" - A system that automatically grades
> > English essays based on Spelling, Grammar and Structure, Coherence,
> > Frequent phrases and Vocabulary as weighted parameters. Realized by
> > implementing a self-designed algorithm ? studying the ?relation graph? of
> > words of the essay.
> >
> > I had also worked on "Sentiment Analysis on Web" - Extraction of reviews
> > about a gadget from tech-review forums, analysis of the Sentiments of the
> > reviews thus predicting the sentiment/opinion associated with that gadget
> > and then generation of appropriate Rating on the scale of 10.
> >
> > The following projects mentioned on the mediawiki's ideas page caught my
> > eye:
> > 1) Wikipedia Corpus Tools
> > 2) Lucene Lemma Analyzers based on Morphology Extraction from Wikipedia
> > Text
> > 3) Lucene Automatic Query Expansion from Wikipedia Text
> > 4) Translation spellchecking
> >
> > Apart from the above projects, I also had the following ideas which i
> feel
> > will be of great help if implemented.
> > a) Implementing a wikiSumarizer widget which will give the summary of the
> > page being read by the user.
> > b) An automatic coherence analyser which would make it easy to find out
> if
> > the article on a given page talks about the same topic
> > c) Details Aggregator for page.
> >
> > I would be grateful if you could kindly let me know about the specific
> > requirements of the projects and about your thoughts on my ideas so that
> I
> > can suitably write a proposal.
> >
> > Eagerly waiting for your response.
> >
> > Thanking you.
> >
> > Best Regards,
> > Karthik.
> > _______________________________________________
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
>
>
_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining (John Erling Blad)

Reply via email to