Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-12 Thread Gautham Shankar
Robert Stojnic  gmail.com> writes:

> 
> 
> Hello,
> 
> Yep, generating the wodnet itself is a challenging and interesting 
> project. I was simply commenting on the Lucene part, i.e. on possible 
> application.
> 
> Currently the lucene backend works by employing some very general rules 
> (e.g. titles get highest score, then first sentence in articled, then 
> first paragraph, then words occurring in clusters e.g. within ~20 words, 
> etc..). However, in many cases they fail.
> 
> I found it helpful to run a number of queries and then see when/why the 
> search fails to identify the most relevant article. When wordnet is 
> mentioned, two examples come in mind which are both currently unsolved. 
> One is a query of type "mao last name" where an article "mao (surname)". 
> If we are lucky, the article will have words "last name" somewhere in 
> the article and the search won't totally fail, however, it would be nice 
> if the algorithm knew that "last name" == "surname". Another is when the 
> query is of type "population of africa" and the article "African 
> population". That is, it would be helpful if the backend knew of 
> language constructs like "x of y" == "x-an y". I wonder if Wordnet type 
> of approach can find those cases as well.
> 
> Cheers, Robert
> 
> On 06/04/12 17:54, Oren Bochman wrote:
> > Hi Robert Stojnic and Gautham Shankar
> >
> > I wanted to let Gautham that he has written a great proposal and thank you
> > for the feedback as well.
> >
> > I wanted to point out that in my point of view the main goal of this
> > multilingual wordnet isn't queary expansion, but rather means for ever
> > greater cross language capabilites in search and content analytics. A
> > wordnet seme can be  further disambiguated using a topic map algorithm run
> > which would consider all the contexts like you suggest. But this is planned
> > latter and so the wordnet would be a milestone.
> > To further clarify Gautham's integration will place a XrossLanguage-seme
> > Word Net tokens during indexing for words it recognises - allow the ranking
> > algorithm to use knowldege drawn from all the wikipedia articles.
> > (For example one part of the ranking would peek into featured article in
> > German on "A" rank it>>  then "B" featured in Hungarian and use them as
> > oracles to rank A>>  B>>  ... in English where the picture might now be X
> >>> Y>>  Z>>  ... B>>  A ...)
> > I mention in passing that I have began to develop dataset for use with open
> > relavance to sytematicly review and evaluate dramatic changes to relevance
> > due to changes in the search engine. I will post on this in due course as
> > it matures - since I am working on a number of smaller projects i'd like to
> > demo at WikiMania.)
> >

Hello,

Thank you Oren for your feedback , would love to work on the wordnet creation 
if 
given an opportunity.

And regarding Robert's mail, yes I believe that using a wordnet will be able to 
solve the problem in both the examples you pointed out.

In the first case during query expansion, the word "last name" would yield the 
synonyms of the word , one of them being "surname". Thus when the query is run 
there will be a hit for the article "mao (surname)".

In the second example, the word "Africa" will be drilled down to get derived 
words like "African" . Also the in other cases the root words will be found and 
searched for. In this case "Africa" is already a root word. So hopefully these 
expansions should solve the language construct problems.

Again the key is to filter out the noise that could come from adding unwanted 
expansion words. For this we will have to find the relevance of the expansion 
words with respect to the given search query and the existing documents. Maybe 
the TSN concept that i pointed out in the earlier mail would help in doing so.

Regards,
Gautham Shankar



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-08 Thread Robert Stojnic


Hello,

Yep, generating the wodnet itself is a challenging and interesting 
project. I was simply commenting on the Lucene part, i.e. on possible 
application.


Currently the lucene backend works by employing some very general rules 
(e.g. titles get highest score, then first sentence in articled, then 
first paragraph, then words occurring in clusters e.g. within ~20 words, 
etc..). However, in many cases they fail.


I found it helpful to run a number of queries and then see when/why the 
search fails to identify the most relevant article. When wordnet is 
mentioned, two examples come in mind which are both currently unsolved. 
One is a query of type "mao last name" where an article "mao (surname)". 
If we are lucky, the article will have words "last name" somewhere in 
the article and the search won't totally fail, however, it would be nice 
if the algorithm knew that "last name" == "surname". Another is when the 
query is of type "population of africa" and the article "African 
population". That is, it would be helpful if the backend knew of 
language constructs like "x of y" == "x-an y". I wonder if Wordnet type 
of approach can find those cases as well.


Cheers, Robert


On 06/04/12 17:54, Oren Bochman wrote:

Hi Robert Stojnic and Gautham Shankar

I wanted to let Gautham that he has written a great proposal and thank you
for the feedback as well.

I wanted to point out that in my point of view the main goal of this
multilingual wordnet isn't queary expansion, but rather means for ever
greater cross language capabilites in search and content analytics. A
wordnet seme can be  further disambiguated using a topic map algorithm run
which would consider all the contexts like you suggest. But this is planned
latter and so the wordnet would be a milestone.
To further clarify Gautham's integration will place a XrossLanguage-seme
Word Net tokens during indexing for words it recognises - allow the ranking
algorithm to use knowldege drawn from all the wikipedia articles.
(For example one part of the ranking would peek into featured article in
German on "A" rank it>>  then "B" featured in Hungarian and use them as
oracles to rank A>>  B>>  ... in English where the picture might now be X

Y>>  Z>>  ... B>>  A ...)

I mention in passing that I have began to develop dataset for use with open
relavance to sytematicly review and evaluate dramatic changes to relevance
due to changes in the search engine. I will post on this in due course as
it matures - since I am working on a number of smaller projects i'd like to
demo at WikiMania.)

On Fri, Apr 6, 2012 at 6:01 PM, Gautham Shankar<
gautham.shan...@hiveusers.com>  wrote:


Robert Stojnic  gmail.com>  writes:



Hi Gautham,

I think mining wiktionary is an interesting project. However, about the
more practical Lucene part: at some point I tried using wordnet to
expand queries however I found that it introduces too many false
positives. The most challenging part I think it *context-based*
expansion. I.e. a simple synonym-based expansion is of no use because it
introduces too many meanings that the user didn't quite have in mind.
However, if we could somehow use the words in the query to find a
meaning from a set of possible meanings that could be really helpful.

You can look into existing lucene-search source to see how I used
wordnet. I think in the end I ended up using it only for very obvious
stuff (e.g. 11 = eleven, UK = United Kingdom, etc..).

Cheers, r.

On 06/04/12 01:58, Gautham Shankar wrote:

Hello,

Based on the feedback i received i have updated my proposal page.

https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc

There is about 20 Hrs for the deadline and any final feedback would be
useful.
I have also submitted the proposal at the GSOC page.

Regards,
Gautham Shankar
___
Wikitech-l mailing list
Wikitech-l  lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Hi Robert,

Thank you for your feedback.
Like you pointed out, query expansion using the wordnet data directly,
reduces
the quality of the search.

I found this research paper very interesting.
www.sftw.umac.mo/~fstzgg/dexa2005.pdf
They have built a TSN (Term Semantic Network) for the given query based on
the
usage of words in the documents. The expansion words obtained from the
wordnet
are then filtered out based on the TSN data.

I did not add this detail to my proposal since i thought it deals more
with the
creation of the wordnet. I would love to implement the TSN concept once the
wordnet is complete.

Regards,
Gautham Shankar



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Hi again




___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-06 Thread Oren Bochman
Hi Robert Stojnic and Gautham Shankar

I wanted to let Gautham that he has written a great proposal and thank you
for the feedback as well.

I wanted to point out that in my point of view the main goal of this
multilingual wordnet isn't queary expansion, but rather means for ever
greater cross language capabilites in search and content analytics. A
wordnet seme can be  further disambiguated using a topic map algorithm run
which would consider all the contexts like you suggest. But this is planned
latter and so the wordnet would be a milestone.
To further clarify Gautham's integration will place a XrossLanguage-seme
Word Net tokens during indexing for words it recognises - allow the ranking
algorithm to use knowldege drawn from all the wikipedia articles.
(For example one part of the ranking would peek into featured article in
German on "A" rank it >> then "B" featured in Hungarian and use them as
oracles to rank A >> B >> ... in English where the picture might now be X
>> Y >> Z >> ... B >> A ...)

I mention in passing that I have began to develop dataset for use with open
relavance to sytematicly review and evaluate dramatic changes to relevance
due to changes in the search engine. I will post on this in due course as
it matures - since I am working on a number of smaller projects i'd like to
demo at WikiMania.)

On Fri, Apr 6, 2012 at 6:01 PM, Gautham Shankar <
gautham.shan...@hiveusers.com> wrote:

> Robert Stojnic  gmail.com> writes:
>
> >
> >
> > Hi Gautham,
> >
> > I think mining wiktionary is an interesting project. However, about the
> > more practical Lucene part: at some point I tried using wordnet to
> > expand queries however I found that it introduces too many false
> > positives. The most challenging part I think it *context-based*
> > expansion. I.e. a simple synonym-based expansion is of no use because it
> > introduces too many meanings that the user didn't quite have in mind.
> > However, if we could somehow use the words in the query to find a
> > meaning from a set of possible meanings that could be really helpful.
> >
> > You can look into existing lucene-search source to see how I used
> > wordnet. I think in the end I ended up using it only for very obvious
> > stuff (e.g. 11 = eleven, UK = United Kingdom, etc..).
> >
> > Cheers, r.
> >
> > On 06/04/12 01:58, Gautham Shankar wrote:
> > > Hello,
> > >
> > > Based on the feedback i received i have updated my proposal page.
> > >
> > > https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc
> > >
> > > There is about 20 Hrs for the deadline and any final feedback would be
> > > useful.
> > > I have also submitted the proposal at the GSOC page.
> > >
> > > Regards,
> > > Gautham Shankar
> > > ___
> > > Wikitech-l mailing list
> > > Wikitech-l  lists.wikimedia.org
> > > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > >
> >
>
> Hi Robert,
>
> Thank you for your feedback.
> Like you pointed out, query expansion using the wordnet data directly,
> reduces
> the quality of the search.
>
> I found this research paper very interesting.
> www.sftw.umac.mo/~fstzgg/dexa2005.pdf
> They have built a TSN (Term Semantic Network) for the given query based on
> the
> usage of words in the documents. The expansion words obtained from the
> wordnet
> are then filtered out based on the TSN data.
>
> I did not add this detail to my proposal since i thought it deals more
> with the
> creation of the wordnet. I would love to implement the TSN concept once the
> wordnet is complete.
>
> Regards,
> Gautham Shankar
>
>
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>

Hi again

-- 

Oren Bochman

Office tel. 061 4921492
Mobile +36 30 866 6706
skype id: orenbochman
e-mail: o...@romai-horizon.com
site http://www.riverport.hu
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-06 Thread Gautham Shankar
Robert Stojnic  gmail.com> writes:

> 
> 
> Hi Gautham,
> 
> I think mining wiktionary is an interesting project. However, about the 
> more practical Lucene part: at some point I tried using wordnet to 
> expand queries however I found that it introduces too many false 
> positives. The most challenging part I think it *context-based* 
> expansion. I.e. a simple synonym-based expansion is of no use because it 
> introduces too many meanings that the user didn't quite have in mind. 
> However, if we could somehow use the words in the query to find a 
> meaning from a set of possible meanings that could be really helpful.
> 
> You can look into existing lucene-search source to see how I used 
> wordnet. I think in the end I ended up using it only for very obvious 
> stuff (e.g. 11 = eleven, UK = United Kingdom, etc..).
> 
> Cheers, r.
> 
> On 06/04/12 01:58, Gautham Shankar wrote:
> > Hello,
> >
> > Based on the feedback i received i have updated my proposal page.
> >
> > https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc
> >
> > There is about 20 Hrs for the deadline and any final feedback would be
> > useful.
> > I have also submitted the proposal at the GSOC page.
> >
> > Regards,
> > Gautham Shankar
> > ___
> > Wikitech-l mailing list
> > Wikitech-l  lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
> 

Hi Robert,

Thank you for your feedback.
Like you pointed out, query expansion using the wordnet data directly, reduces 
the quality of the search.

I found this research paper very interesting.
www.sftw.umac.mo/~fstzgg/dexa2005.pdf
They have built a TSN (Term Semantic Network) for the given query based on the 
usage of words in the documents. The expansion words obtained from the wordnet 
are then filtered out based on the TSN data.

I did not add this detail to my proposal since i thought it deals more with the 
creation of the wordnet. I would love to implement the TSN concept once the 
wordnet is complete.

Regards,
Gautham Shankar



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-06 Thread Robert Stojnic


Hi Gautham,

I think mining wiktionary is an interesting project. However, about the 
more practical Lucene part: at some point I tried using wordnet to 
expand queries however I found that it introduces too many false 
positives. The most challenging part I think it *context-based* 
expansion. I.e. a simple synonym-based expansion is of no use because it 
introduces too many meanings that the user didn't quite have in mind. 
However, if we could somehow use the words in the query to find a 
meaning from a set of possible meanings that could be really helpful.


You can look into existing lucene-search source to see how I used 
wordnet. I think in the end I ended up using it only for very obvious 
stuff (e.g. 11 = eleven, UK = United Kingdom, etc..).


Cheers, r.

On 06/04/12 01:58, Gautham Shankar wrote:

Hello,

Based on the feedback i received i have updated my proposal page.

https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc

There is about 20 Hrs for the deadline and any final feedback would be
useful.
I have also submitted the proposal at the GSOC page.

Regards,
Gautham Shankar
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l




___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012: "Lucene Automatic Query Expansion From Wikipedia Text"

2012-04-06 Thread Gautham Shankar
Hi,

I have addressed the issues in my talk page and added a 'Future Project
Maintenance' section to address maintenance needs.

https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc

Kindly let me know if there are any other changes i have to make.

Thank you for your support,

Regards,
Gautham Shankar
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-05 Thread Gregory Varnum
Also a reminder for folks that this and some other proposals need mentors.

Gautham - thank you for the updated proposal page. I would also solicit 
feedback in our Irc channel if you can and connect with interested mentors:
https://www.mediawiki.org/wiki/GSOC#Mentor_signup

https://www.mediawiki.org/wiki/MediaWiki_on_IRC

-Greg aka varnent

___
Sent from my iPad. Apologies for any typos. A more detailed response may be 
sent later.

On Apr 5, 2012, at 8:58 PM, Gautham Shankar  
wrote:

> Hello,
> 
> Based on the feedback i received i have updated my proposal page.
> 
> https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc
> 
> There is about 20 Hrs for the deadline and any final feedback would be
> useful.
> I have also submitted the proposal at the GSOC page.
> 
> Regards,
> Gautham Shankar
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion From Wikipedia Text

2012-04-05 Thread Gautham Shankar
Hello,

Based on the feedback i received i have updated my proposal page.

https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc

There is about 20 Hrs for the deadline and any final feedback would be
useful.
I have also submitted the proposal at the GSOC page.

Regards,
Gautham Shankar
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012

2012-04-05 Thread Markus Krötzsch

Dear Sudeep,

note that the proposal for "Semantic form rules" is proposed and 
mentored by Stephan Gambke, who is a contributor to MW/SMW. I am not 
sure if he is reading this list. I have contacted him now, but if you do 
not hear from him very soon, I suggest to focus on one of the other 
projects that you are interested in.


Markus


On 04/04/12 04:39, Gregory Varnum wrote:

Sudeep,

Thank you for your interest in our GSOC effort this year!!

I'm intrigued by some of your ideas - especially the backwards compatibility 
extension (although I imagine that has more benefit for third-party wikis than 
WMF wikis).

Google's deadline is quickly approach - April 6th.  I'd suggest going to our 
GSOC page on the MW.org site - https://www.mediawiki.org/wiki/GSOC - reviewing 
the info there, posting your proposal on your Userpage, then post it on this 
listserv and in our IRC room - https://www.mediawiki.org/wiki/MediaWiki_on_IRC

-greg aka varnent


On Apr 3, 2012, at 2:47 PM, Sudeep Singh  wrote:


Hi,

I am sudeep. I am final year student at Indian Institute of Technology,
Kharagpur in the computer science department.

I am interested to apply in the following projects for gsoc 2012

1. Lucene automatic query expansion from wikipedia text
2. Backwards compatibility extension
3. Semantic form rules
4. Index transcluded text in search

I have a strong background in Information retrieval and Machine learning. I
have worked previously with Yahoo Research Labs in the area of Information
retrieval. We extracted association rules and attribite-value pairs from
the webpages using unsupervised approach.

I have also worked on another project with yahoo, which involved emotion
detection of youtube videos, based on the comments of the users. We used
various ML, Statisitcs andf IR techniques to achieve our goal.

I last year succesfully completed GSOC 2011, with OSGEO and have good
experience in Open Source Development.

Kindly let me know how shall I proceed with my application.

Thanks
regards
Sudeep
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l




___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012

2012-04-05 Thread Oren Bochman
Hi we are running out of time

Thanks for your interest in out project at Wikimedia
The GSOC proposals should be mostly specified by the student – 
Those of you who have not done so should draft proposals and place them in
at
www.mediawiki.org in your user space, then post a link here, or email me so
we can process them.

1.  I have expanded the requirement of my project ideas a bit. However I
have left room for your ideas. 
There is plenty of similar work published on these subject -- Research these
and refine your proposals with tools/algorithms you would like to use and
preferred formats, so that deliverables that would be widely reused.
2.  I am contacting two researchers who have worked on similar projects
to check if they wish to Co-operate by  contribute Code and helping with the
Linguistics side of the Mentoring.
3.  I can answer specific questions you have about expectation.

To optimally  match you with a suitable high impact project – please let us
know:
*Your development experience what projects have you done and where – 

specially what are your experience with:
*Java and other programming languages?
*PHP
*Apache Lucene or Solr
*Natural Language Processing 
*Data Mining
*Corpus Linguistics
*WordNet

Since these projects are highly multilingual please tell us what is your
native language and what other language you 
can use (scale from 1 beginner  to 5 near native).






Background,
Ability in programming,


Operation Manager 
E-mail: o...@romai-horizon.com
Mobil: +36 30 866 6706



Római Horizon Kft. 
H-1039 Budapest 
Királyok útja  291. D. ép. fszt. 2.
Tel:   +36 1 492 1492
Fax:  +36 1 266 5529


-Original Message-
From: wikitech-l-boun...@lists.wikimedia.org
[mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Sudeep Singh
Sent: Tuesday, April 03, 2012 8:48 PM
To: wikitech-l@lists.wikimedia.org
Subject: [Wikitech-l] GSOC 2012

Hi,

I am sudeep. I am final year student at Indian Institute of Technology,
Kharagpur in the computer science department.

I am interested to apply in the following projects for gsoc 2012

1. Lucene automatic query expansion from wikipedia text 2. Backwards
compatibility extension 3. Semantic form rules 4. Index transcluded text in
search

I have a strong background in Information retrieval and Machine learning. I
have worked previously with Yahoo Research Labs in the area of Information
retrieval. We extracted association rules and attribite-value pairs from the
webpages using unsupervised approach.

I have also worked on another project with yahoo, which involved emotion
detection of youtube videos, based on the comments of the users. We used
various ML, Statisitcs andf IR techniques to achieve our goal.

I last year succesfully completed GSOC 2011, with OSGEO and have good
experience in Open Source Development.

Kindly let me know how shall I proceed with my application.

Thanks
regards
Sudeep
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012: Mentees interested in the 'Who's been awesome' proposal and mentors still looking for help

2012-04-04 Thread James Alexander
Hey everyone,

I've had an awful lot of interest in the Who's Been Awesome/Get merchandise
to reward the community extension I proposed and we can only really take
one in the end so I wanted to make sure that everyone knew the score and
mentors still looking for help could chime in and let us know.

There has been 8 or 9 people ask about the project and we have 1, almost
full, proposal so far. Part of that has been me being slow in responding
but if you're interested I encourage you to either submit a proposal soon
or look at other options (or both!) so that we can get as many people in a
possible! If you are still waiting for answers from me or you have other
questions feel free to shoot me an email and I'll be setting time aside
tomorrow (bed soon) to go through them all.

Other mentors if you're still looking for help please let us know so that
we can get as many of these great candidates as possible!

James

-- 
James Alexander
Manager, Merchandise
Wikimedia Foundation
(415) 839-6885 x6716 @jamesofur
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion from Wikipedia Text

2012-04-04 Thread Gregory Varnum
Greetings,

Thank you for putting this proposal together.

I would expand a bit on how you plan to implement this.  The why and what seem 
reasonably clear to me in your proposal, but I'd be curious what others think.

You'll also want to look at the GSOC page on MW.org and in the IRC to aide your 
efforts to find an interested mentor.

-greg


On Apr 4, 2012, at 6:25 PM, Gautham Shankar  
wrote:

> Hello,
> 
> I'm Gautham Shankar from India pursuing my 4th year bachelors in computer
> science and engineering.I find the project proposal "Lucene Automatic Query
> Expansion from Wikipedia Text" in GSOC 2012 very interesting and would love
> to work on it.
> 
> i have created a proposal for the idea
> 
> https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc
> 
> I have experience in data mining and have built a recommendation framework
> using the heat diffusion principle which has been tested on the AOL search
> dataset to recommend better queries that can be typed
> for a given input query.It has been implemented in java. Since it is a
> framework it can be used to recommend different types of data. for example
> the same framework can be used to recommend movies as well as music.Im
> currently working on an extension of this project to add social network
> graphs so as to recommend people who have the same interests in movie,
> music etc when a query is typed.
> 
> I have also built a web based product "hive" which is a networking platform
> for members of the power generation industry. The users can share their
> experiences and it is a open forum where members interact with one another
> to effectively run their machines and solve common problems. The product
> has been implemented using PHP, mysql, javascript (inc ajax). Lucene is the
> search engine and phpbb is used for forums.
> 
> it would be very helpful if anyone could give a feedback and guide me in
> improving the proposal.
> 
> Eagerly awaiting a response.
> 
> Regards,
> Gautham Shankar
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012 : Lucene Automatic Query Expansion from Wikipedia Text

2012-04-04 Thread Gautham Shankar
Hello,

I'm Gautham Shankar from India pursuing my 4th year bachelors in computer
science and engineering.I find the project proposal "Lucene Automatic Query
Expansion from Wikipedia Text" in GSOC 2012 very interesting and would love
to work on it.

i have created a proposal for the idea

https://www.mediawiki.org/wiki/User:Gautham_shankar/Gsoc

I have experience in data mining and have built a recommendation framework
using the heat diffusion principle which has been tested on the AOL search
dataset to recommend better queries that can be typed
for a given input query.It has been implemented in java. Since it is a
framework it can be used to recommend different types of data. for example
the same framework can be used to recommend movies as well as music.Im
currently working on an extension of this project to add social network
graphs so as to recommend people who have the same interests in movie,
music etc when a query is typed.

I have also built a web based product "hive" which is a networking platform
for members of the power generation industry. The users can share their
experiences and it is a open forum where members interact with one another
to effectively run their machines and solve common problems. The product
has been implemented using PHP, mysql, javascript (inc ajax). Lucene is the
search engine and phpbb is used for forums.
 
it would be very helpful if anyone could give a feedback and guide me in
improving the proposal.

Eagerly awaiting a response.

Regards,
Gautham Shankar
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools (Oren Bochman) (Amir E. Aharoni)(Gregory Varnum)

2012-04-04 Thread Gregory Varnum
This looks much more in-depth and helpful. I think your best next step is to, 
if you haven't already, connect with potential mentors and indicate who those 
folks are within your proposal.

-Greg
___
Sent from my iPad. Apologies for any typos. A more detailed response may be 
sent later.

On Apr 4, 2012, at 10:31 AM, karthik prasad  wrote:

> Dear Sirs,
> I am grateful for your valuable feedback and suggestions.
> 
> I have updated my proposal based on the inputs given by you. The split-up
> of the deliverables on the ideas page indeed helped me understand the
> requirements more clearly.
> 
> The link to my updated proposal is
> https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal
> 
> I request you and everyone to kindly skim through my proposal once again
> and suggest changes/additions.
> I am very excited about this project and working with you; and truth be
> told, 23rd April seems like ages ahead.
> 
> Thanking you,
> Yours sincerely,
> Karthik
> 
> 
>> Date: Wed, 4 Apr 2012 11:49:41 +0200
>> From: "Oren Bochman" 
>> To: "'Wikimedia developers'" 
>> Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
>> Message-ID: <007f01cd1248$42ee6f40$c8cb4dc0$@com>
>> Content-Type: text/plain;   charset="utf-8"
>> 
>> You do understand correctly!
>> 
>> The main idea about NLP components is with POS tagger as an example:
>> 
>> 1. a fall back system that does unsupervised POS tagging.
>> 2. the ability to plug in an existing POS tagger as these become
>> available for specific languages.
>> 
>> I would as supervisor would recommend working with 3 languages.
>> English, Hebrew, and the GSOC native language.
>> 
>> If we could get QA from other native speakers we would incorporate them
>> into the workflow.
>> 
>> I think that by using a deletion/reversion based heuristic we may also be
>> able to make a spam corpus to boost the accuracy of the corpuses.
>> 
>> 
>> Operation Manager
>> E-mail: o...@romai-horizon.com
>> Mobil: +36 30 866 6706
>> 
>> 
>> 
>> R?mai Horizon Kft.
>> H-1039 Budapest
>> Kir?lyok ?tja  291. D. ?p. fszt. 2.
>> Tel:   +36 1 492 1492
>> Fax:  +36 1 266 5529
>> 
>> -Original Message-
>> From: wikitech-l-boun...@lists.wikimedia.org [mailto:
>> wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Amir E. Aharoni
>> Sent: Tuesday, April 03, 2012 10:19 PM
>> To: Wikimedia developers
>> Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
>> 
>> 2012/4/3 karthik prasad :
>>> Hello,
>>> I am a GSoC aspirant and have compiled a proposal for one of the
>>> project ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman] I
>>> would sincerely appreciate if you could kindly go through it and
>>> suggest corrections/additions so that I can settle with a coherent
>> proposal.
>>> 
>>> Link to my proposal :
>>> https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal
>> 
>> Nice, but why only English?
>> 
>> If i understand the proposal correctly, this project is supposed to be
>> able to work with almost any language with very little effort.
>> 
>> --
>> Amir Elisha Aharoni ? ?? ? ??
>> http://aharoni.wordpress.com ??We're living in pieces, I want to live in
>> peace.? ? T. Moore?
>> 
>> ___
>> Wikitech-l mailing list
>> Wikitech-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>> 
>> 
>> 
>> 
>> --
>> 
>> 
>> Date: Wed, 4 Apr 2012 12:58:11 +0300
>> From: "Amir E. Aharoni" 
>> To: Wikimedia developers 
>> Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
>> Message-ID:
>>   >> 
>> Content-Type: text/plain; charset=UTF-8
>> 
>> 2012/4/4 Oren Bochman :
>>> You do understand correctly!
>>> 
>>> The main idea about NLP components is with POS tagger as an example:
>> 
>> Just to make sure, POS = part of speech, isn't it?
>> 
>> It's one of the most confusing TLAs in computing :)
>> 
>>> If we could get QA from other native speakers we would incorporate them
>> into the workflow.
>> 
>> Good. As long as there is a way to plug other languages and a way for
>> speakers of other languages to contribute QA, i'm very happy.
>> 
&g

Re: [Wikitech-l] GSOC-2012(proposal)Convention Extension

2012-04-04 Thread Gregory Varnum
As a reminder to folks. Please remember that students are in need of mentors. 
Its key to their applications. Please designate your interest on the MW.org 
GSOC page to help connect with students. You're also welcome to use this list 
to reach out to folks like Akshay who you may feel is addressing a known 
problem on our sites. :)

https://www.mediawiki.org/wiki/GSOC

-Greg
___
Sent from my iPad. Apologies for any typos. A more detailed response may be 
sent later.

On Apr 4, 2012, at 4:12 PM, akshay chugh  wrote:

> Hi everyone,
> 
> I seek to work on building a Convention extension as part of the Google
> Summer of Code project.I have set up a proposal for the same , here is the
> link http://www.mediawiki.org/wiki/User:Chughakshay16/GSOCProposal(2012).
> I haven't found a mentor to work with me for this project yet, so if anyone
> feels the need for this extension just the way I do, please feel free to
> add the feedback to the proposal page, or reply here.
> More information regarding this extension can be found here :-
> http://www.mediawiki.org/wiki/User:Chughakshay16/ConventionExtension
> Thanks ,
> Akshay Chugh
> (irc - chughakshay16)
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC-2012(proposal)Convention Extension

2012-04-04 Thread akshay chugh
Hi everyone,

I seek to work on building a Convention extension as part of the Google
Summer of Code project.I have set up a proposal for the same , here is the
link http://www.mediawiki.org/wiki/User:Chughakshay16/GSOCProposal(2012).
I haven't found a mentor to work with me for this project yet, so if anyone
feels the need for this extension just the way I do, please feel free to
add the feedback to the proposal page, or reply here.
More information regarding this extension can be found here :-
http://www.mediawiki.org/wiki/User:Chughakshay16/ConventionExtension
Thanks ,
Akshay Chugh
(irc - chughakshay16)
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 - Integration of Flickr and enhancing Geolocation support in Upload Wizard

2012-04-04 Thread drecodeam




On Wednesday 4 April 2012 at 9:18 AM, Gregory Varnum wrote:

> drecodeam,
> 
> This proposal and the other one I just responded to about Flickr seem to have 
> some overlaps. I suspect only one would be accepted - and this one does 
> appear to be more developed at this stage.
> 
> However, similar comments as before - has input been sought from folks 
> administering existing Flickr tools on Commons and Toolserver - also - is the 
> assumption that Flickr has everything ready on their end for us to do this 
> integration a correct one on my part?
> 
Thanks for the reply greg. 
I had seen one of these tools before, but have not got in touch with the 
developers. I will get in touch with them asap ( since the deadline is closing 
in), and take their inputs on the project. 
I have been discussing the project with my mentor, and as far as i can see, 
Flickr API seems to fulfill the requirements for the project. 
> 
> -greg aka varnent
> 
> 
> 
> On Apr 2, 2012, at 2:31 AM, drecodeam  (mailto:drecod...@gmail.com)> wrote:
> 
> > Hi, 
> > I am Ankur Anand( drecodeam ). I have been interested in working on the 
> > idea of integrating Flickr and enhancing the current Geolocation support in 
> > the Upload Wizard extension. 
> > I have been talking to my mentor Ryan Kaldari, and have drafted my 
> > proposal: http://www.mediawiki.org/wiki/User:Drecodeam/GSoC_2012_Application
> > I know its almost closing in to the final submission date, but it would be 
> > great to have feedback and comments from the community over my proposal. 
> > 
> > Thanks
> > -- 
> > drecodeam
> > 
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org (mailto:Wikitech-l@lists.wikimedia.org)
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> > 
> 
> 
> 
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org (mailto:Wikitech-l@lists.wikimedia.org)
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> 
> 


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools (Oren Bochman) (Amir E. Aharoni)(Gregory Varnum)

2012-04-04 Thread karthik prasad
Dear Sirs,
I am grateful for your valuable feedback and suggestions.

I have updated my proposal based on the inputs given by you. The split-up
of the deliverables on the ideas page indeed helped me understand the
requirements more clearly.

The link to my updated proposal is
https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal

I request you and everyone to kindly skim through my proposal once again
and suggest changes/additions.
I am very excited about this project and working with you; and truth be
told, 23rd April seems like ages ahead.

Thanking you,
Yours sincerely,
Karthik


> Date: Wed, 4 Apr 2012 11:49:41 +0200
> From: "Oren Bochman" 
> To: "'Wikimedia developers'" 
> Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
> Message-ID: <007f01cd1248$42ee6f40$c8cb4dc0$@com>
> Content-Type: text/plain;   charset="utf-8"
>
> You do understand correctly!
>
> The main idea about NLP components is with POS tagger as an example:
>
> 1. a fall back system that does unsupervised POS tagging.
> 2. the ability to plug in an existing POS tagger as these become
>  available for specific languages.
>
> I would as supervisor would recommend working with 3 languages.
> English, Hebrew, and the GSOC native language.
>
> If we could get QA from other native speakers we would incorporate them
> into the workflow.
>
> I think that by using a deletion/reversion based heuristic we may also be
> able to make a spam corpus to boost the accuracy of the corpuses.
>
>
> Operation Manager
> E-mail: o...@romai-horizon.com
> Mobil: +36 30 866 6706
>
>
>
> R?mai Horizon Kft.
> H-1039 Budapest
> Kir?lyok ?tja  291. D. ?p. fszt. 2.
> Tel:   +36 1 492 1492
> Fax:  +36 1 266 5529
>
> -Original Message-
> From: wikitech-l-boun...@lists.wikimedia.org [mailto:
> wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Amir E. Aharoni
> Sent: Tuesday, April 03, 2012 10:19 PM
> To: Wikimedia developers
> Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
>
> 2012/4/3 karthik prasad :
> > Hello,
> > I am a GSoC aspirant and have compiled a proposal for one of the
> > project ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman] I
> > would sincerely appreciate if you could kindly go through it and
> > suggest corrections/additions so that I can settle with a coherent
> proposal.
> >
> > Link to my proposal :
> > https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal
>
> Nice, but why only English?
>
> If i understand the proposal correctly, this project is supposed to be
> able to work with almost any language with very little effort.
>
> --
> Amir Elisha Aharoni ? ?? ? ??
> http://aharoni.wordpress.com ??We're living in pieces, I want to live in
> peace.? ? T. Moore?
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
>
>
>
> --
>
>
> Date: Wed, 4 Apr 2012 12:58:11 +0300
> From: "Amir E. Aharoni" 
> To: Wikimedia developers 
> Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
> Message-ID:
> >
> Content-Type: text/plain; charset=UTF-8
>
> 2012/4/4 Oren Bochman :
> > You do understand correctly!
> >
> > The main idea about NLP components is with POS tagger as an example:
>
> Just to make sure, POS = part of speech, isn't it?
>
> It's one of the most confusing TLAs in computing :)
>
> > If we could get QA from other native speakers we would incorporate them
> into the workflow.
>
> Good. As long as there is a way to plug other languages and a way for
> speakers of other languages to contribute QA, i'm very happy.
>
> --
> Amir Elisha Aharoni ? ?? ? ??
> http://aharoni.wordpress.com
> ??We're living in pieces,
> I want to live in peace.? ? T. Moore?
>


Date: Wed, 4 Apr 2012 00:28:29 -0400
From: Gregory Varnum 
To: Wikimedia developers 
Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools
Message-ID: 
Content-Type: text/plain; charset=utf-8

Whoops - I meant that email to be directed to Karthik - although Amir
you're welcome to read it as well.  :)

-greg


On Apr 3, 2012, at 11:24 PM, Gregory Varnum 
wrote:

> Amir,
>
> Thank you for your GSOC proposal!  :)
>
> Between now and Google's submission deadline on April 6th - you are
invited to further modify your proposals.  The GSOC page on MW.org -
https://www.mediawiki.org/wiki/GSOC - and our IRC rooms -
https://www.mediawiki.org/wiki/MediaWiki_

Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012-04-04 Thread Amir E. Aharoni
2012/4/4 Oren Bochman :
> You do understand correctly!
>
> The main idea about NLP components is with POS tagger as an example:

Just to make sure, POS = part of speech, isn't it?

It's one of the most confusing TLAs in computing :)

> If we could get QA from other native speakers we would incorporate them into 
> the workflow.

Good. As long as there is a way to plug other languages and a way for
speakers of other languages to contribute QA, i'm very happy.

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012-04-04 Thread Oren Bochman
You do understand correctly!

The main idea about NLP components is with POS tagger as an example:

1. a fall back system that does unsupervised POS tagging.
2. the ability to plug in an existing POS tagger as these become  available for 
specific languages.

I would as supervisor would recommend working with 3 languages.
English, Hebrew, and the GSOC native language.

If we could get QA from other native speakers we would incorporate them into 
the workflow.

I think that by using a deletion/reversion based heuristic we may also be able 
to make a spam corpus to boost the accuracy of the corpuses.


Operation Manager 
E-mail: o...@romai-horizon.com
Mobil: +36 30 866 6706



Római Horizon Kft. 
H-1039 Budapest 
Királyok útja  291. D. ép. fszt. 2.
Tel:   +36 1 492 1492
Fax:  +36 1 266 5529

-Original Message-
From: wikitech-l-boun...@lists.wikimedia.org 
[mailto:wikitech-l-boun...@lists.wikimedia.org] On Behalf Of Amir E. Aharoni
Sent: Tuesday, April 03, 2012 10:19 PM
To: Wikimedia developers
Subject: Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012/4/3 karthik prasad :
> Hello,
> I am a GSoC aspirant and have compiled a proposal for one of the 
> project ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman] I 
> would sincerely appreciate if you could kindly go through it and 
> suggest corrections/additions so that I can settle with a coherent proposal.
>
> Link to my proposal :
> https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal

Nice, but why only English?

If i understand the proposal correctly, this project is supposed to be able to 
work with almost any language with very little effort.

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com 
‪“We're living in pieces, I want to live in peace.” – T. Moore‬

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] GSOC-2012 proposal(Convention Extension)

2012-04-04 Thread akshay chugh
Hi varnent,

Before setting up this proposal , I have had some discussions with (^demon
and saper) about the scope and need for such an extension. Not only they
suggested me some features that they would want to see as part of this
extension but they pointed out some issues as well that I would face while
developing this extension . And seeing the links they provided for further
clarity , I narrowed down the scope of this project keeping in mind the
most important deliverables that need to be met.
Well i didnt get any chance to talk to other wikimania folks about the web
forms as such but they sure provided me some links showing the
implementation of  such features in wikimania .
These are the implementation links :-

   - scholarship template(wikimania 2006) -
   http://wikimania2006.wikimedia.org/wiki/Scholarships/Application
   - scholarships code used for wikimania-
   https://github.com/saper/Scholarship

Discussion with saper -
https://www.mediawiki.org/wiki/User_talk:Chughakshay16/ConventionExtension
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 proposal - Ajax functions for editing, blocking, deleting and checking recent changes

2012-04-03 Thread Gregory Varnum
Excellent - you're welcome to share it here for folks to review.  :)

-greg


On Apr 4, 2012, at 2:35 AM, Marco  wrote:

> 
>> 
>> Looking over your application - I'd suggest flushing our more details on 
>> your proposed schedule and what exactly the advantages of your proposed 
>> project are.  The mockup helps convey the what - but they how and why are a 
>> bit vague to me.  I think connecting the functional benefits of Ajax could 
>> help.
>> 
> Hello. I've just done that, thanks for the attention.
> 
> 
> 
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 proposal - Ajax functions for editing, blocking, deleting and checking recent changes

2012-04-03 Thread Marco




Looking over your application - I'd suggest flushing our more details on your 
proposed schedule and what exactly the advantages of your proposed project are. 
 The mockup helps convey the what - but they how and why are a bit vague to me. 
 I think connecting the functional benefits of Ajax could help.


Hello. I've just done that, thanks for the attention.



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 - Automatic Taxobox Usability and Feature Enhancements

2012-04-03 Thread Gregory Varnum
Hey Connor,

Thank you for posting a proposal on MW.org and for your interest in GSOC.

Anything seeking deployment on Wikipedia is certainly an ambitious plan.  I 
would also consider investigating other MWF projects that may benefit as well.  
Sometimes deployment there first is helpful in building the case for enWP 
deployment - which the editor community and high volume of the site tends to 
involve some legwork and selling of the idea.  :)

Regarding your GSOC proposal, this is a good place to solicit some feedback - 
you can also try our IRC room:  https://www.mediawiki.org/wiki/MediaWiki_on_IRC

Be sure to fill out the other sections as they'll all be reviewed and 
considered when the deadline (April 6th) comes.

Thanks,
-greg aka varnent


On Apr 4, 2012, at 1:28 AM, Bacon Prime  wrote:

> Hi,
> 
> I'm writing a proposal to improve the usability of the automatic taxobox on
> Wikipedia; specifically, to create a GUI such that all future taxoboxes
> could be created with it, and most current taxoboxes could be edited
> without ever touching the templating syntax. In the optimal case, this
> would obviate the need for the extensive support needed (~10 archived talk
> pages over two years!) and might spark a conversion from the manual taxobox
> to the automatic (which is desirable to the micro-optimizing programmer in
> me) -- in the suboptimal case, such an improvement would still save many
> people many hours of work and even out the learning curve.
> 
> As an ancillary benefit, the wizard-like processes described in the help
> pages could be easily converted into actual wizards, which would also cut
> down on errors (i.e. when steps are applied incorrectly, out of order or
> inappropriately).
> 
> A quick blurb about myself: I'm 19, nearly into my third year of university
> and majoring in Computational Linguistics at Simon Fraser University. Also,
> I'd really, really like to be given the chance to contribute something
> positive to Wikipedia's community!
> 
> Proposal Page 
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012 - Automatic Taxobox Usability and Feature Enhancements

2012-04-03 Thread Bacon Prime
Hi,

I'm writing a proposal to improve the usability of the automatic taxobox on
Wikipedia; specifically, to create a GUI such that all future taxoboxes
could be created with it, and most current taxoboxes could be edited
without ever touching the templating syntax. In the optimal case, this
would obviate the need for the extensive support needed (~10 archived talk
pages over two years!) and might spark a conversion from the manual taxobox
to the automatic (which is desirable to the micro-optimizing programmer in
me) -- in the suboptimal case, such an improvement would still save many
people many hours of work and even out the learning curve.

As an ancillary benefit, the wizard-like processes described in the help
pages could be easily converted into actual wizards, which would also cut
down on errors (i.e. when steps are applied incorrectly, out of order or
inappropriately).

A quick blurb about myself: I'm 19, nearly into my third year of university
and majoring in Computational Linguistics at Simon Fraser University. Also,
I'd really, really like to be given the chance to contribute something
positive to Wikipedia's community!

Proposal Page 
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012-04-03 Thread Gregory Varnum
Whoops - I meant that email to be directed to Karthik - although Amir you're 
welcome to read it as well.  :)

-greg


On Apr 3, 2012, at 11:24 PM, Gregory Varnum  wrote:

> Amir,
> 
> Thank you for your GSOC proposal!  :)
> 
> Between now and Google's submission deadline on April 6th - you are invited 
> to further modify your proposals.  The GSOC page on MW.org - 
> https://www.mediawiki.org/wiki/GSOC - and our IRC rooms - 
> https://www.mediawiki.org/wiki/MediaWiki_on_IRC
> 
> Looking over your proposal - I think you've got good background information 
> on yourself.  However, I think you should flush out more details on the 
> proposed project.  Without more familiarity with corpus (and with no links to 
> find that info) - it's hard for everyone to weigh in equally or to make sure 
> your project gets the full consideration you'd like.
> 
> -greg aka varnent
> 
> 
> On Apr 3, 2012, at 4:18 PM, Amir E. Aharoni  
> wrote:
> 
>> 2012/4/3 karthik prasad :
>>> Hello,
>>> I am a GSoC aspirant and have compiled a proposal for one of the project
>>> ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman]
>>> I would sincerely appreciate if you could kindly go through it and suggest
>>> corrections/additions so that I can settle with a coherent proposal.
>>> 
>>> Link to my proposal :
>>> https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal
>> 
>> Nice, but why only English?
>> 
>> If i understand the proposal correctly, this project is supposed to be
>> able to work with almost any language with very little effort.
>> 
>> --
>> Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
>> http://aharoni.wordpress.com
>> ‪“We're living in pieces,
>> I want to live in peace.” – T. Moore‬
>> 
>> ___
>> Wikitech-l mailing list
>> Wikitech-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> 


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] GSoC 2012 - Integration of Flickr and enhancing Geolocation support in Upload Wizard

2012-04-03 Thread Gregory Varnum
drecodeam,

This proposal and the other one I just responded to about Flickr seem to have 
some overlaps.  I suspect only one would be accepted - and this one does appear 
to be more developed at this stage.

However, similar comments as before - has input been sought from folks 
administering existing Flickr tools on Commons and Toolserver - also - is the 
assumption that Flickr has everything ready on their end for us to do this 
integration a correct one on my part?

-greg aka varnent



On Apr 2, 2012, at 2:31 AM, drecodeam  wrote:

> Hi, 
> I am Ankur Anand( drecodeam ). I have been interested in working on the idea 
> of integrating Flickr and enhancing the current Geolocation support in the 
> Upload Wizard extension. 
> I have been talking to my mentor Ryan Kaldari, and have drafted my proposal: 
> http://www.mediawiki.org/wiki/User:Drecodeam/GSoC_2012_Application
> I know its almost closing in to the final submission date, but it would be 
> great to have feedback and comments from the community over my proposal. 
> 
> Thanks
> -- 
> drecodeam
> 
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC-2012 proposal(Convention Extension)

2012-04-03 Thread Gregory Varnum
Hi there Akshay,

I've talked with a few Wikimania folks - and I think there's some interest in 
this.  My impression is our current system is a series of patch jobs done to 
evolve the original system to our growing needs.  A complete rewrite seems 
fitting, from what I've seen and heard.

I'm curious if you've had a chance to speak with Wikimania folks as well - 
especially the ops folks doing the web forms and such.  I imagine they'd have a 
wealth of opinions on what has worked and what wasn't worked thus far.  I would 
also be curious what some Hackathon and other event organizers might have to 
say.  Ideally doing a rewrite will include all the various needs (well - as 
many as reasonably possible..) from these folks so more more patchwork would 
need to be done in the foreseeable future.

Here's the IRC channel for Wikimania and other contact info:
http://wikimania2012.wikimedia.org/wiki/IRC
http://wikimania2012.wikimedia.org/wiki/Organizing_team

-greg aka varnent


On Apr 3, 2012, at 10:17 AM, akshay chugh  wrote:

> Max,
> I have taken your points into consideration, and have made changes to my
> proposal.Have also narrowed down the scope as well.
> updated proposal : -
> www.mediawiki.org/wiki/User:Chughakshay16/GSOCProposal(2012)
>> How are you going to write special pages and API modules before the
> database access layer?
> honest mistake :)(corrected that in my project schedule)
> 
> Thanks,
> Akshay Chugh
> (irc- chughakshay16)
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012

2012-04-03 Thread Gregory Varnum
Sudeep,

Thank you for your interest in our GSOC effort this year!!

I'm intrigued by some of your ideas - especially the backwards compatibility 
extension (although I imagine that has more benefit for third-party wikis than 
WMF wikis).  

Google's deadline is quickly approach - April 6th.  I'd suggest going to our 
GSOC page on the MW.org site - https://www.mediawiki.org/wiki/GSOC - reviewing 
the info there, posting your proposal on your Userpage, then post it on this 
listserv and in our IRC room - https://www.mediawiki.org/wiki/MediaWiki_on_IRC

-greg aka varnent


On Apr 3, 2012, at 2:47 PM, Sudeep Singh  wrote:

> Hi,
> 
> I am sudeep. I am final year student at Indian Institute of Technology,
> Kharagpur in the computer science department.
> 
> I am interested to apply in the following projects for gsoc 2012
> 
> 1. Lucene automatic query expansion from wikipedia text
> 2. Backwards compatibility extension
> 3. Semantic form rules
> 4. Index transcluded text in search
> 
> I have a strong background in Information retrieval and Machine learning. I
> have worked previously with Yahoo Research Labs in the area of Information
> retrieval. We extracted association rules and attribite-value pairs from
> the webpages using unsupervised approach.
> 
> I have also worked on another project with yahoo, which involved emotion
> detection of youtube videos, based on the comments of the users. We used
> various ML, Statisitcs andf IR techniques to achieve our goal.
> 
> I last year succesfully completed GSOC 2011, with OSGEO and have good
> experience in Open Source Development.
> 
> Kindly let me know how shall I proceed with my application.
> 
> Thanks
> regards
> Sudeep
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 proposal - Ajax functions for editing, blocking, deleting and checking recent changes

2012-04-03 Thread Gregory Varnum
Marco,

Thank you for putting these together - I think the test wiki helps convey what 
you're proposing.

Do you have a proposal on MediaWiki.org as well?  There's info on doing this 
at:  https://www.mediawiki.org/wiki/GSOC#Student_applications

Looking over your application - I'd suggest flushing our more details on your 
proposed schedule and what exactly the advantages of your proposed project are. 
 The mockup helps convey the what - but they how and why are a bit vague to me. 
 I think connecting the functional benefits of Ajax could help.

Thank you,
-greg aka varnent


On Apr 3, 2012, at 4:12 PM, Marco  wrote:

> Following my proposal at 
> http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/marco_inacio/1
> 
> 
> I created a test wiki http://wikitest.freehosting.com/ 
>  for you to take a 
> look at the functionalities of the script I pretend to adapt, 
> internationalize and use as base for my gsoc project (details at the 
> proposal). It was a quick translation and much of it is still in Portuguese, 
> but I think it's ok for a preview.
> 
> Any considerations would be appreciated.
> 
> 
> 
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012-04-03 Thread Gregory Varnum
Amir,

Thank you for your GSOC proposal!  :)

Between now and Google's submission deadline on April 6th - you are invited to 
further modify your proposals.  The GSOC page on MW.org - 
https://www.mediawiki.org/wiki/GSOC - and our IRC rooms - 
https://www.mediawiki.org/wiki/MediaWiki_on_IRC

Looking over your proposal - I think you've got good background information on 
yourself.  However, I think you should flush out more details on the proposed 
project.  Without more familiarity with corpus (and with no links to find that 
info) - it's hard for everyone to weigh in equally or to make sure your project 
gets the full consideration you'd like.

-greg aka varnent


On Apr 3, 2012, at 4:18 PM, Amir E. Aharoni  
wrote:

> 2012/4/3 karthik prasad :
>> Hello,
>> I am a GSoC aspirant and have compiled a proposal for one of the project
>> ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman]
>> I would sincerely appreciate if you could kindly go through it and suggest
>> corrections/additions so that I can settle with a coherent proposal.
>> 
>> Link to my proposal :
>> https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal
> 
> Nice, but why only English?
> 
> If i understand the proposal correctly, this project is supposed to be
> able to work with almost any language with very little effort.
> 
> --
> Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
> http://aharoni.wordpress.com
> ‪“We're living in pieces,
> I want to live in peace.” – T. Moore‬
> 
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012-04-03 Thread Amir E. Aharoni
2012/4/3 karthik prasad :
> Hello,
> I am a GSoC aspirant and have compiled a proposal for one of the project
> ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman]
> I would sincerely appreciate if you could kindly go through it and suggest
> corrections/additions so that I can settle with a coherent proposal.
>
> Link to my proposal :
> https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal

Nice, but why only English?

If i understand the proposal correctly, this project is supposed to be
able to work with almost any language with very little effort.

--
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
http://aharoni.wordpress.com
‪“We're living in pieces,
I want to live in peace.” – T. Moore‬

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] GSOC 2012 proposal - Ajax functions for editing, blocking, deleting and checking recent changes

2012-04-03 Thread Marco
Following my proposal at 
http://www.google-melange.com/gsoc/proposal/review/google/gsoc2012/marco_inacio/1



I created a test wiki http://wikitest.freehosting.com/ 
 for you to 
take a look at the functionalities of the script I pretend to adapt, 
internationalize and use as base for my gsoc project (details at the 
proposal). It was a quick translation and much of it is still in 
Portuguese, but I think it's ok for a preview.


Any considerations would be appreciated.



___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSoC 2012: Proposal-Wikipedia Corpus Tools

2012-04-03 Thread karthik prasad
Hello,
I am a GSoC aspirant and have compiled a proposal for one of the project
ideas - Wikipedia Corpus Tools. [Mentor : Oren Bochman]
I would sincerely appreciate if you could kindly go through it and suggest
corrections/additions so that I can settle with a coherent proposal.

Link to my proposal :
https://www.mediawiki.org/wiki/User:Karthikprasad/gsoc2012proposal

Thanking you.

Best Regards,
Karthik.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012

2012-04-03 Thread Sudeep Singh
Hi,

I am sudeep. I am final year student at Indian Institute of Technology,
Kharagpur in the computer science department.

I am interested to apply in the following projects for gsoc 2012

1. Lucene automatic query expansion from wikipedia text
2. Backwards compatibility extension
3. Semantic form rules
4. Index transcluded text in search

I have a strong background in Information retrieval and Machine learning. I
have worked previously with Yahoo Research Labs in the area of Information
retrieval. We extracted association rules and attribite-value pairs from
the webpages using unsupervised approach.

I have also worked on another project with yahoo, which involved emotion
detection of youtube videos, based on the comments of the users. We used
various ML, Statisitcs andf IR techniques to achieve our goal.

I last year succesfully completed GSOC 2011, with OSGEO and have good
experience in Open Source Development.

Kindly let me know how shall I proceed with my application.

Thanks
regards
Sudeep
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC-2012 proposal(Convention Extension)

2012-04-03 Thread akshay chugh
Max,
I have taken your points into consideration, and have made changes to my
proposal.Have also narrowed down the scope as well.
updated proposal : -
www.mediawiki.org/wiki/User:Chughakshay16/GSOCProposal(2012)
>How are you going to write special pages and API modules before the
database access layer?
honest mistake :)(corrected that in my project schedule)

Thanks,
Akshay Chugh
(irc- chughakshay16)
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC-2012 proposal(Convention Extension)

2012-04-03 Thread Max Semenik
On 03.04.2012, 11:51 akshay wrote:

> Hi everyone,

Hi, please see https://meta.wikimedia.org/wiki/Mailing_lists#Using_digests

> I am an engineering student from Bits Pilani (India) currently in my 5th
> and final year.I seek to apply for GSOC this year under Mediawiki. So from
> the list of ideas given on the Mediawiki gsoc page, I wish to  work on
> building a convention extension which would help convert any wiki into a
> conference like website such as Wikimania.After having discussions over IRC
> channels regarding the features that this extension should possess , and
> some feedback that I got from other developers I have written a proposal
> for this extension. I would really appreciate any feedback in this short
> period of time left, as it would help me in setting the right deliverables
> for this project.
> The proposal page -
> http://www.mediawiki.org/wiki/User:Chughakshay16/GSOCProposal%282012%29
> The other details about this extension can be found on the following pages:
> 1. implementation details (+UI mockups) -
> http://www.mediawiki.org/wiki/User:Chughakshay16/ConventionExtension
> 2. database details -
> http://www.mediawiki.org/wiki/User:Chughakshay16/databasedetails
> The talk pages for the above can also be used for the feedback.


Some comments on proposal details:

> 1. user profile pages for the attendees
We already have userpages, don't we? ;)

> 4. special badge feature for the attendees
What is it?

> 7. payment gateway - for managing the financial transactions occurring during 
> the conference
Seems overkillish for a summer project.

> 8. some custom tags to design the conference wiki pages.
What is it and why not just use templates?

> 9. poll management - for setting up polls across the conference
There are existing solutions for polls.

How are you going to write special pages and API modules before
the database access layer?

-- 
Best regards,
  Max Semenik ([[User:MaxSem]])


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC-2012 proposal(Convention Extension)

2012-04-03 Thread akshay chugh
Hi everyone,

I am an engineering student from Bits Pilani (India) currently in my 5th
and final year.I seek to apply for GSOC this year under Mediawiki. So from
the list of ideas given on the Mediawiki gsoc page, I wish to  work on
building a convention extension which would help convert any wiki into a
conference like website such as Wikimania.After having discussions over IRC
channels regarding the features that this extension should possess , and
some feedback that I got from other developers I have written a proposal
for this extension. I would really appreciate any feedback in this short
period of time left, as it would help me in setting the right deliverables
for this project.
The proposal page -
http://www.mediawiki.org/wiki/User:Chughakshay16/GSOCProposal%282012%29
The other details about this extension can be found on the following pages:
1. implementation details (+UI mockups) -
http://www.mediawiki.org/wiki/User:Chughakshay16/ConventionExtension
2. database details -
http://www.mediawiki.org/wiki/User:Chughakshay16/databasedetails
The talk pages for the above can also be used for the feedback.

Thanks,
Akshay Chugh
(irc - chughakshay16)

On Tue, Apr 3, 2012 at 4:46 AM, wrote:

> Send Wikitech-l mailing list submissions to
>wikitech-l@lists.wikimedia.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> or, via email, send a message with subject or body 'help' to
>wikitech-l-requ...@lists.wikimedia.org
>
> You can reach the person managing the list at
>wikitech-l-ow...@lists.wikimedia.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Wikitech-l digest..."
>
>
> Today's Topics:
>
>   1. Re: Time to redirect to https by default? (Ryan Lane)
>   2. Re: Time to redirect to https by default? (Ryan Lane)
>   3. Re: Time to redirect to https by default? (Ryan Lane)
>   4. Re: Time to redirect to https by default? (MZMcBride)
>   5. Re: Committing followups: please no --amend (Chad)
>   6. Re: Time to redirect to https by default? (Platonides)
>   7. Re: Time to redirect to https by default? (Ryan Lane)
>   8. rsync on scap/sync reporting 'no space left on device'for a
>  lot of hosts (Arthur Richards)
>   9. Re: Time to redirect to https by default? (Antoine Musso)
>
>
> --
>
> Message: 1
> Date: Tue, 3 Apr 2012 03:34:13 +0900
> From: Ryan Lane 
> To: Wikimedia developers 
> Subject: Re: [Wikitech-l] Time to redirect to https by default?
> Message-ID:
> >
> Content-Type: text/plain; charset=ISO-8859-1
>
> On Mon, Apr 2, 2012 at 12:33 PM, Tim Starling 
> wrote:
> > On 02/04/12 06:14, Ryan Lane wrote:
> >> TL;DR: we have no plans for anonymous HTTPS by default, but will
> >> eventually default to HTTPS for logged-in users.
> >>
> >> 1. It would require an ssl terminator on every frontend cache. The ssl
> >> terminators eat memory, which is also what the frontend caches do.
> >
> > Once we enable it by default for logged-in users, we will care a lot
> > more if someone tries to take it down with a DoS attack. Unless the
> > redirection can be disabled without actually logging in, a DoS attack
> > on the HTTPS frontend would prevent any authenticated activity.
> >
> > It suggests a need for a robust, overprovisioned service, with tools
> > and procedures in place for identifying and blocking or throttling
> > malicious traffic.
> >
>
> Indeed. We're already pretty over provisioned. We have 4 servers per
> datacenter, each of which is very bored. All they are doing is acting
> as a transparent proxy, after ssl termination. We're using RC4 by
> default (due to BEAST), and AES is also available (the processors we
> are using have AES support).
>
> Ideally we'll be using STS for logged in users. This will mean it's
> impossible to turn off the redirection for users that have already
> logged in for whatever period of time we have STS headers set. We need
> to consider blocking a DoS from the SSL proxies, the LVS servers, or
> the routers.
>
> >> 3. Some countries may completely block HTTPS, but allow HTTP to our
> >> sites so that they can track users. Is it better for us to provide
> >> them content, or protect their privacy?
> >> 4. It's still possible for governments to see that people are going to
> >> wikimedia sites when using HTTPS, so it's still possible to oppress
> >> people for trying to visit sites that are disallowed.
> >
> > It's also possible for governments to snoop on HTTPS communications,
> > by using a private key from a trusted CA to perform a
> > man-in-the-middle attack. Apparently the government of Iran has done
> this.
> >
>
> We really should publish our certificate fingerprints. An attack like
> this can be detected. An end-user being attacked can see if the
> certificate they are being handed is different from the one we
> advertise. We could also pr

Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining (Oren Bochman)

2012-04-02 Thread karthik prasad
Dear Oren Bochman,

I am very pleased to hear from you.

My familiarity with the requirements *on a scale of 5* are as follows:

   1. Java and other programming languages :: *  4.5   *...I have done
   courses on Java, C, C++. I have extensively used Python in my projects. I
   am very comfortable with the syntax and semantics and understanding
   different libraries won't be difficult
   2. PHP :: *   3.5   *...I have used php in my project and
   am undergoing a course on it in my university.
   3. Apache Lucene :: *   2   *...I was not very familiar with this
   library until recently. However, I am very much willing to learn this
   as soon as possible, and be comfortable with it before the
   coding period starts.
   4. Natural Language Processing:: *   4   *...Language processing and
   Data is my major interest and I have done all my projects on NLP. I have
   taken up the course on NLP being offered at coursera.org. NLP is what i
   discuss with my professors at my university too.
   5. Computational Linguistics and Word net :: *   4   *...I am using the
   principles of computational linguistics and the wordnet in my current
   project- Automatic essay grader. Also, I have chosen Data Mining as an
   elective and am comfortable with the field


I was looking for some clarifications regarding the proposed ideas:

   1. Regarding the first project :: "a framework for handling different
   languages."...how exactly should we be looking at 'handling' languages?
   what kind of frame work is expected?
   2. Regarding the second project :: "Make a Lucene filter which uses such
   a wordnet to expand search terms."...does this project aim at building
   everything from scratch or revamping the existing code?

My understanding of the proposed idea 1 is : "To extract the corpus
from Wikipedia and and to apply the deliverables on them." Please correct
me if I am missing something.
Also, I was wondering if you were thinking of some specific approach or
would it be OK if i come up with an approach and propose the same in my
proposal.

Some more details regarding my Essay Grader project. The grader does take
care of the essay coherence. Spelling and grammar are, as you pointed out
important, but not too informative when it comes to the "relatedness" of
the essay. The essays are also graded based on the structure. We tried to
analyse the statistics of the essay to come up with a measure to grade the
essay structure.

I am very excited about this and am eagerly looking forward to hear from
you.

Thank you.

Best Regards,
Karthik


> Date: Mon, 2 Apr 2012 11:46:21 +0200
> From: "Oren Bochman" 
> To: 
> Subject: Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining
> Message-ID: <017401cd10b5$769f9fb0$63dedf10$@com>
> Content-Type: text/plain;   charset="us-ascii"
>
> Dear, Karthik Prasad & Other GSOC candidates.
>
>
>
> I was not getting this list but I am now.
>
>
>
> The GSOC proposal should be specified by the student.
>
>
>
> I'll can expand the details on these projects.
>
> I can answer specific questions you have about expectation.
>
>
>
> To optimally  match you with a suitable high impact project - to what
> extent
> are you familiar with :
>
> *Java and other programming languages?
>
> *PHP?
>
> *Apache Lucene?
>
> *Natural Language Processing?
>
> *Corpus Linguistics?
>
> *Word Net?
>
>
>
> The listed projects would be either wrapped as services, or consumed by
> downstream projects or both.
>
>
>
> The corpus is the simplest but requires lots of attention to detail. When
> successful, it would be picked up by lots of
>
> researchers and companies who do not have the resources for doing such CPU
> intensive tasks.
>
> For WMF it would provide us with a standardized body for future NLP work. A
> Part Of Speech tagged corpus would
> be immediately useful for an 80% accurate word sense disambiguation in the
> search engine.
>
>
>
> Automatic Summaries are not a strategic priority AFAIK -
>
> 1.   most articles provide a kind of abstract in their intro and
>
> 2.   there are something like this already provided in the dumps for
> yahoo.
>
> 3.   I have been using a great pop up preview widget in Wiktionary for
> a
> year or so.
>
>
>
> I do think it would be a great project to learn how to become a MediaWiki
> developer but is small for a GSOC.
> However I cannot speak for Jebald and other mentors in cellular and other
> teams who might be interested in this.
>
>
>
> If your easy grader is working it could be the basis of another very
> exciting GSOC project aimed at article quality.
>
> A NLP savvy "smart" article quality assessment

Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining

2012-04-02 Thread Oren Bochman
Dear, Karthik Prasad & Other GSOC candidates.

 

I was not getting this list but I am now.

 

The GSOC proposal should be specified by the student.

 

I'll can expand the details on these projects.

I can answer specific questions you have about expectation.

 

To optimally  match you with a suitable high impact project - to what extent
are you familiar with :

*Java and other programming languages?

*PHP?

*Apache Lucene?

*Natural Language Processing?

*Corpus Linguistics?

*Word Net?

 

The listed projects would be either wrapped as services, or consumed by
downstream projects or both.

 

The corpus is the simplest but requires lots of attention to detail. When
successful, it would be picked up by lots of 

researchers and companies who do not have the resources for doing such CPU
intensive tasks.

For WMF it would provide us with a standardized body for future NLP work. A
Part Of Speech tagged corpus would 
be immediately useful for an 80% accurate word sense disambiguation in the
search engine.

 

Automatic Summaries are not a strategic priority AFAIK - 

1.   most articles provide a kind of abstract in their intro and 

2.   there are something like this already provided in the dumps for
yahoo.  

3.   I have been using a great pop up preview widget in Wiktionary for a
year or so.

 

I do think it would be a great project to learn how to become a MediaWiki
developer but is small for a GSOC. 
However I cannot speak for Jebald and other mentors in cellular and other
teams who might be interested in this.



If your easy grader is working it could be the basis of another very
exciting GSOC project aimed at article quality.

A NLP savvy "smart" article quality assessment service could improve/expand
the current bots grading articles. 
Grammar and spelling are two good indicators, features. However a full
assessment of Wikipedia articles would 
require more details - both stylistic and information based. Once you have
covered sufficient features 
building discriminators based on samples of graded articles would require
some data mining ability.

 

However since there is an Existing bot, undergoing upgrades  we would have
to check with its small dev team what it currently doing

And it would be subject to community oversight. 

 

Yours Sincerely,

 

Oren Bochman

 

MediaWiki Search Developer

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSoC 2012 - Integration of Flickr and enhancing Geolocation support in Upload Wizard

2012-04-01 Thread drecodeam
Hi, 
I am Ankur Anand( drecodeam ). I have been interested in working on the idea of 
integrating Flickr and enhancing the current Geolocation support in the Upload 
Wizard extension. 
I have been talking to my mentor Ryan Kaldari, and have drafted my proposal: 
http://www.mediawiki.org/wiki/User:Drecodeam/GSoC_2012_Application
I know its almost closing in to the final submission date, but it would be 
great to have feedback and comments from the community over my proposal. 

Thanks
-- 
drecodeam

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining (John Erling Blad)

2012-03-31 Thread karthik prasad
Thank you very much for your feedback Jeblad.

I will immediately look into how this can be best implemented by extending
the Mediawiki API.
Do kindly let me know about my other ideas so that I can shape my proposal
well.

The mentor for ideas I am interested in is Oren Bochman. But I couldn't
track him on the irc.
I would love to interact with him or any other mentor and discuss my ideas
in detail.

I am recahable at
Email  : karthikprasad...@gmail.com
SkypeID  : prasadkarthik
Facebook: facebook.com/prasadkarthik
Google+  : gplus.to/karthikprasad
twitter  : twitter.com/_karthikprasad


Date: Sat, 31 Mar 2012 12:05:00 +0200
> From: John Erling Blad 
> To: Wikimedia developers 
> Subject: Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining
> Message-ID:
> >
> Content-Type: text/plain; charset=windows-1252
>
> Your point (a) "Implementing a wikiSumarizer widget which will give the
> summary of the page being read by the user" could be extremely usefull for
> a hover/ helpbubbles functionality where bubbles with a small explanations
> are created within external articles. Such functionality imply creating an
> extension to the Mediawiki API.
>
> Jeblad
>
> On Sat, Mar 31, 2012 at 11:09 AM, karthik prasad <
> karthikprasad...@gmail.com
> > wrote:
>
> > Hello,
> > I am Karthik from India - currently pursuing 3rd year Bachelors in
> Computer
> > Science and Engineering in PESIT, Bangalore.
> >
> > I am interested in some of the projects proposed for Google SOC 2012 and
> > would love to work and contribute the same to the open-source world.
> >
> > I am very attracted towards Text Processing and Data Mining. I have
> > undertaken course in Natural Language Processing. I am currently working
> on
> > a project "Automatic Essay Grader" - A system that automatically grades
> > English essays based on Spelling, Grammar and Structure, Coherence,
> > Frequent phrases and Vocabulary as weighted parameters. Realized by
> > implementing a self-designed algorithm ? studying the ?relation graph? of
> > words of the essay.
> >
> > I had also worked on "Sentiment Analysis on Web" - Extraction of reviews
> > about a gadget from tech-review forums, analysis of the Sentiments of the
> > reviews thus predicting the sentiment/opinion associated with that gadget
> > and then generation of appropriate Rating on the scale of 10.
> >
> > The following projects mentioned on the mediawiki's ideas page caught my
> > eye:
> > 1) Wikipedia Corpus Tools
> > 2) Lucene Lemma Analyzers based on Morphology Extraction from Wikipedia
> > Text
> > 3) Lucene Automatic Query Expansion from Wikipedia Text
> > 4) Translation spellchecking
> >
> > Apart from the above projects, I also had the following ideas which i
> feel
> > will be of great help if implemented.
> > a) Implementing a wikiSumarizer widget which will give the summary of the
> > page being read by the user.
> > b) An automatic coherence analyser which would make it easy to find out
> if
> > the article on a given page talks about the same topic
> > c) Details Aggregator for page.
> >
> > I would be grateful if you could kindly let me know about the specific
> > requirements of the projects and about your thoughts on my ideas so that
> I
> > can suitably write a proposal.
> >
> > Eagerly waiting for your response.
> >
> > Thanking you.
> >
> > Best Regards,
> > Karthik.
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
>
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSOC 2012 - Text Processing and Data Mining

2012-03-31 Thread John Erling Blad
Your point (a) "Implementing a wikiSumarizer widget which will give the
summary of the page being read by the user" could be extremely usefull for
a hover/ helpbubbles functionality where bubbles with a small explanations
are created within external articles. Such functionality imply creating an
extension to the Mediawiki API.

Jeblad

On Sat, Mar 31, 2012 at 11:09 AM, karthik prasad  wrote:

> Hello,
> I am Karthik from India - currently pursuing 3rd year Bachelors in Computer
> Science and Engineering in PESIT, Bangalore.
>
> I am interested in some of the projects proposed for Google SOC 2012 and
> would love to work and contribute the same to the open-source world.
>
> I am very attracted towards Text Processing and Data Mining. I have
> undertaken course in Natural Language Processing. I am currently working on
> a project "Automatic Essay Grader" - A system that automatically grades
> English essays based on Spelling, Grammar and Structure, Coherence,
> Frequent phrases and Vocabulary as weighted parameters. Realized by
> implementing a self-designed algorithm – studying the ‘relation graph’ of
> words of the essay.
>
> I had also worked on "Sentiment Analysis on Web" - Extraction of reviews
> about a gadget from tech-review forums, analysis of the Sentiments of the
> reviews thus predicting the sentiment/opinion associated with that gadget
> and then generation of appropriate Rating on the scale of 10.
>
> The following projects mentioned on the mediawiki's ideas page caught my
> eye:
> 1) Wikipedia Corpus Tools
> 2) Lucene Lemma Analyzers based on Morphology Extraction from Wikipedia
> Text
> 3) Lucene Automatic Query Expansion from Wikipedia Text
> 4) Translation spellchecking
>
> Apart from the above projects, I also had the following ideas which i feel
> will be of great help if implemented.
> a) Implementing a wikiSumarizer widget which will give the summary of the
> page being read by the user.
> b) An automatic coherence analyser which would make it easy to find out if
> the article on a given page talks about the same topic
> c) Details Aggregator for page.
>
> I would be grateful if you could kindly let me know about the specific
> requirements of the projects and about your thoughts on my ideas so that I
> can suitably write a proposal.
>
> Eagerly waiting for your response.
>
> Thanking you.
>
> Best Regards,
> Karthik.
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC 2012 - Text Processing and Data Mining

2012-03-31 Thread karthik prasad
Hello,
I am Karthik from India - currently pursuing 3rd year Bachelors in Computer
Science and Engineering in PESIT, Bangalore.

I am interested in some of the projects proposed for Google SOC 2012 and
would love to work and contribute the same to the open-source world.

I am very attracted towards Text Processing and Data Mining. I have
undertaken course in Natural Language Processing. I am currently working on
a project "Automatic Essay Grader" - A system that automatically grades
English essays based on Spelling, Grammar and Structure, Coherence,
Frequent phrases and Vocabulary as weighted parameters. Realized by
implementing a self-designed algorithm – studying the ‘relation graph’ of
words of the essay.

I had also worked on "Sentiment Analysis on Web" - Extraction of reviews
about a gadget from tech-review forums, analysis of the Sentiments of the
reviews thus predicting the sentiment/opinion associated with that gadget
and then generation of appropriate Rating on the scale of 10.

The following projects mentioned on the mediawiki's ideas page caught my
eye:
1) Wikipedia Corpus Tools
2) Lucene Lemma Analyzers based on Morphology Extraction from Wikipedia Text
3) Lucene Automatic Query Expansion from Wikipedia Text
4) Translation spellchecking

Apart from the above projects, I also had the following ideas which i feel
will be of great help if implemented.
a) Implementing a wikiSumarizer widget which will give the summary of the
page being read by the user.
b) An automatic coherence analyser which would make it easy to find out if
the article on a given page talks about the same topic
c) Details Aggregator for page.

I would be grateful if you could kindly let me know about the specific
requirements of the projects and about your thoughts on my ideas so that I
can suitably write a proposal.

Eagerly waiting for your response.

Thanking you.

Best Regards,
Karthik.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] [GSoC 2012] Application: Realtime Collaboration on Visual Editor

2012-03-16 Thread Ashish Dubey
Hi Everyone,

So with respect to my GSoC proposal earlier [1], I've written an
application draft under my user page [2].

The scope of the project has been greatly resolved. I would be really
thankful, if I get some feedback.

Thank you.

Links -
[1] -
http://lists.wikimedia.org/pipermail/wikitech-l/2012-February/058254.html
[2] - http://www.mediawiki.org/wiki/User:Dash1291/GSoC_2012_Application

-- 
Ashish Dubey
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSOC-2012 Proposal(Writing a convention extension for mediawiki)

2012-02-23 Thread akshay chugh
Hi all,

I am an undergraduate student, currently pursuing my Integrated degree(
Msc. Chemistry and B.Tech Electronics& Instrumentation) from Bits
Pilani(India). I am looking forward to apply for GSOC this year under
Mediawiki. So from the list of projects/ideas stated on the Mediawiki gsoc
page I wish to work on the idea of developing a convention extension which
could be plugged into a wiki running the Mediawiki software.The current
requirement states to have an extension which would help convert any wiki
into a suitable website for conference ,such as Wikimania

 Wikimania currently offers these many features :-
   - a separate main page is dedicated for listing the conference details
   - and features like registration, submission and talk (voting) are
present
   - event management features
   - speaker descriptions
   -session details
   -schedule info
   -ticketing feature(payment gateway)

Wikimania is a site which is hand tailored for this purpose, so to get a
better understanding of how such conference management software works i
looked into a couple more available out there on the web such as
OpenConferenceWare(Ruby on Rails)[1] and wisconDB(perl)[2].
So after having looked into the above listed packages and Wikimania I have
come up with a list of tasks that should be performed by this extension :-
1. integration of new set of preferences in the preferences menu (
specially dedicated for the setting up of conference like features or even
a separate menu page just to avoid the clutter)
2. creation of template pages for registration, submission and voting just
like we already have in Wikimania but instead of creating them by hand this
extension would create them for the admin. (this feature would work once
the conference feature is enabled in the first step stated above)
3. creation of separate database tables which would solely handle this
added functionality(just like wisconDB [2])
4. creation of new templates(magic words) , that one could use in other
pages as well (just to show some relevant information regarding the
conference or event organized by the admin user)
-an example of such a template already exists on Wikimania [3]
Also had a discussion with ^demon on IRC regarding the approach that one
should take for building such an extension, he threw me some more ideas
that he had thought about this project and suggested me some more features
such as :-
5. creation of special badges
6. export feature for all the user information which could be available in
CSV or some other format
I just wanted to provide an introduction to this project , that?s why may
not have provided a detailed description of each point written above. So
further information can be looked under this page
https://meta.wikimedia.org/wiki/ConventionExtension<
https://meta.wikimedia.org/wiki/Books>
which i will be using for further development and plans on this project.
Any sort of feedback would be appreciated.

Extra Info :
[1] https://github.com/igal/openconferenceware
[2] https://code.google.com/p/wiscondb/
[3] https://wikimania2012.wikimedia.org/wiki/Main_Page

Thanks,
Akshay Chugh

On Fri, Feb 24, 2012 at 3:11 AM, wrote:

> Send Wikitech-l mailing list submissions to
>wikitech-l@lists.wikimedia.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> or, via email, send a message with subject or body 'help' to
>wikitech-l-requ...@lists.wikimedia.org
>
> You can reach the person managing the list at
>wikitech-l-ow...@lists.wikimedia.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of Wikitech-l digest..."
>
>
> Today's Topics:
>
>   1. Re: Bump of minimum required PHP version to 5.3 for   MediaWiki
>  1.20 (Antoine Musso)
>   2. Re: Bump of minimum required PHP version to 5.3 for MediaWiki
>  1.20 (Trevor Parscal)
>   3. Re: Bump of minimum required PHP version to 5.3 for MediaWiki
>  1.20 (Arthur Richards)
>   4. Re: Please Welcome Christian Aistleitner to Technical
>  Operations (Ben Hartshorne)
>   5. Re: Bump of minimum required PHP version to 5.3 for MediaWiki
>  1.20 (Chad)
>   6. Re: Please Welcome Christian Aistleitner to Technical
>  Operations (Manuel Schneider)
>   7. Re: Bump of minimum required PHP version to 5.3 for MediaWiki
>  1.20 (Patrick Reilly)
>   8. Re: Bump of minimum required PHP version to 5.3 for MediaWiki
>  1.20 (Chad)
>   9. Re: Caching of pages with time sensitive magic words (Platonides)
>  10. Re: Git + Gerrit is a toughy (Antoine Musso)
>
>
> --
>
> Message: 1
> Date: Thu, 23 Feb 2012 21:56:29 +0100
> From: Antoine Musso 
> To: wikitech-l@lists.wikimedia.org
> Subject: Re: [Wikitech-l] Bump of minimum required PHP version to 5.3
>for MediaWiki 1.20
> Message-ID: 
> Content-Type: text/plain; charset=ISO-8859-1; format=flowed
>
> Le 2

Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-21 Thread Shivansh Srivastava
Cant the same be done to allow users to login only through HTTPS, or if
they are on HTTP, user can be redirected to an HTTPS. SO, the script will
only work when the user is over a secured HTTP.

On 22 February 2012 01:24, Roan Kattouw  wrote:

> On Mon, Feb 13, 2012 at 5:28 PM, Daniel Friesen
>  wrote:
> > The idea that login is secure because it's on a separate page than the
> rest
> > of the site is actually an old mistake.
> > If a script is included ANYWHERE on the site on the same domain then it's
> > possible to inject in some code that will fake pageviews in a way that
> will
> > let an attacker have a running script when the user follows the login
> link
> > to the login page.
> > So there isn't really any security advantage of a separate login page
> over
> > an ajax login. (well ;) unless you're using the separate login page
> because
> > you have js disabled, then you're safe, heh)
> >
> Basically what the issue was is that if you're on an unencrypted HTTP
> pageview, you cannot trust the login form that gets AJAXed in, even if
> it submits to HTTPS. If the login form is transferred over HTTP (or
> the script that loads the login form is transferred over HTTP, or if
> *anything* comes on HTTP), it's not secure.
>
> Roan
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
Shivansh Srivastava | +91-955-243-5407 |
http://in.linkedin.com/pub/shivansh-srivastava/17/a50/b18
 Secretary, BITS Alumni Affairs Division
| Web Expert, Newsletter, BITSAA International
3rd Year Undergraduate | B.E. (Hons.) - Electronics & Instrumentation
BITS-Pilani.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-21 Thread Roan Kattouw
On Mon, Feb 13, 2012 at 5:28 PM, Daniel Friesen
 wrote:
> The idea that login is secure because it's on a separate page than the rest
> of the site is actually an old mistake.
> If a script is included ANYWHERE on the site on the same domain then it's
> possible to inject in some code that will fake pageviews in a way that will
> let an attacker have a running script when the user follows the login link
> to the login page.
> So there isn't really any security advantage of a separate login page over
> an ajax login. (well ;) unless you're using the separate login page because
> you have js disabled, then you're safe, heh)
>
Basically what the issue was is that if you're on an unencrypted HTTP
pageview, you cannot trust the login form that gets AJAXed in, even if
it submits to HTTPS. If the login form is transferred over HTTP (or
the script that loads the login form is transferred over HTTP, or if
*anything* comes on HTTP), it's not secure.

Roan

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-21 Thread Trevor Parscal
I would be willing to mentor this project pending some further review so
that we could scope it correctly.

- Trevor

On Sun, Feb 19, 2012 at 10:38 AM, Ashish Dubey wrote:

> Hi Platonides
>
> You're right on the fact that its ambitious. And this is the reason I've
> started working early on this. The key part in the project to me would be
> to consolidate the concurrent transactions and avoid conflicts. I've been
> reading considerably on measures to do so, perhaps working to model basic
> concurrency control. Given, that I'm able to do so now, which I hope I will
> be, there should not be much problem prototyping the real thing.
>
> On Mon, Feb 20, 2012 at 12:04 AM, Platonides  wrote:
>
> > Are you confident you would be able to complete it in time?
> > Looks too ambitious to me.
> >
> >
> >
> >
> > ___
> > Wikitech-l mailing list
> > Wikitech-l@lists.wikimedia.org
> > https://lists.wikimedia.org/mailman/listinfo/wikitech-l
> >
>
>
>
> --
> Ashish Dubey
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-19 Thread Ashish Dubey
Hi Platonides

You're right on the fact that its ambitious. And this is the reason I've
started working early on this. The key part in the project to me would be
to consolidate the concurrent transactions and avoid conflicts. I've been
reading considerably on measures to do so, perhaps working to model basic
concurrency control. Given, that I'm able to do so now, which I hope I will
be, there should not be much problem prototyping the real thing.

On Mon, Feb 20, 2012 at 12:04 AM, Platonides  wrote:

> Are you confident you would be able to complete it in time?
> Looks too ambitious to me.
>
>
>
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
Ashish Dubey
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-19 Thread Platonides
Are you confident you would be able to complete it in time?
Looks too ambitious to me.




___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-19 Thread John Du Hart
No, that's not the answer.
On Feb 19, 2012 8:22 AM, "Thomas Gries"  wrote:

> Am 19.02.2012 14:20, schrieb Ashish Dubey:
> > Hi Everyone
> >
> > The idea of realtime collaboration,
>
>
> use Etherpad Lite
> See https://www.mediawiki.org/wiki/Extension:EtherpadLite
>
>
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-19 Thread Ashish Dubey
Hi Thomas, thanks for the reply.

Great work with the Etherpad Lite. It can be used to bring realtime editing
on wiki pages now, but I think building realtime collaboration into the
Visual Editor project has its own set of advantages. As the Visual Editor
evolves it will support more features of a wiki page, more input methods
and language support. If realtime collaboration is built as a part of
Visual Editor, the features of Visual Editor can be used directly while
collaborating which otherwise would require patching Etherpad forks every
now and then for some feature that is very much Wikimedia-ish.

On Sun, Feb 19, 2012 at 6:52 PM, Thomas Gries  wrote:

> Am 19.02.2012 14:20, schrieb Ashish Dubey:
> > Hi Everyone
> >
> > The idea of realtime collaboration,
>
>
> use Etherpad Lite
> See https://www.mediawiki.org/wiki/Extension:EtherpadLite
>
>
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
Ashish Dubey
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-19 Thread Thomas Gries
Am 19.02.2012 14:20, schrieb Ashish Dubey:
> Hi Everyone
>
> The idea of realtime collaboration, 


use Etherpad Lite
See https://www.mediawiki.org/wiki/Extension:EtherpadLite




signature.asc
Description: OpenPGP digital signature
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] [GSoC 2012] Proposal - Realtime Collaboration on Visual Editor

2012-02-19 Thread Ashish Dubey
Hi Everyone

The idea of realtime collaboration, and the thought of it being implemented
on MediaWiki based sites has amazed me ever since I read about it in the
strategic proposal about Realtime collaboration here[1].
I really wanted to work on it as my GSoC project and through some fruitful
amount of discussions with devs on IRC, this has got me all the more
excited about it.

With the way, Visual Editor has been fundamentally constructed, it should
support effectively building realtime collaboration over it. The
combination of its transactions and annotations provides a structure that
is similar to what is implemented in Google Wave. Operational
Transformation and composition concepts as used by Google Wave can be put
in action on Visual Editor to consolidate the concurrent operations and
prevent conflicts.

The project should include implementing a collaboration server(built on
Node.js), which would serve to the connected clients the wiki page content
in form of a DOM structure and receive the editor transactions from the
clients, transform them against the converging document state and broadcast
them to the other clients. A client adapter would be built which would hook
into the editor's transaction system and transmit them to the server.

With the help of suggestions by Trevor and Gabriel, I've been able to start
some work on this idea. I've wireframed a raw collaboration server and the
client adapter, which works by naively communicating the transactions among
the clients via the server. I'm now working on implementing some sort of
concurrency control on the transactions. All my experimentation work lives
in my GitHub repository[2].

Inviting feedback. Thank you.

Links:
[1] -
http://strategy.wikimedia.org/wiki/Proposal:Collaborative_Google_Wave-based_Wikipedia_editing
[2] - https://github.com/dash1291/VE-collaboration

More:
http://www.mediawiki.org/wiki/Future/Real-time_collaboration
http://www.mediawiki.org/wiki/Visual_editor/Realtime_collaboration

-- 
Ashish Dubey
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-17 Thread Jeremias Nuñez
Thanks for all your replies!

Taking in to account what you all have said, and after re-viewing past
years projects like Sumana said I realized that maybe what was stopping me
from having a clear understanding of the idea and how to sort it out was
actually it's too wide scope.

Looking at other years project's made think about how important it is to
well-define your project's reach and how it will interact with other
components. On the other hand is also what Platonides says about everyone
wanting to have "this" or "that" option able to be set globally across
wikis, which is a never-ending discusion.

So I guess I'll be looking into something else for the time being.
Thanks a million for the great feedback.

Jeremias.

2012/2/16 Platonides 

> On 16/02/12 14:11, Jeremias Nuñez wrote:
> > When you say "extensive consultation" are you referring to the UI or to
> > which preferences could be made global? I think we talked about this on
> IRC
> > too but I don't quite remember.
> >
> > I'd also appreciate if you could point me where I could start "diving
> in",
> > so I can get a sense of how things were done.
> >
> > Thanks again.
>
> The problem is the social part. For instance, some people will want
> their skin to be global, while others will moan because they "need" a
> different one between two wikis.
>
> It should be much easier with the "new" (3 yo) preferences system.
> Conceptually, I think the changes needed to support it are easy: instead
> of using the user preference or the wiki default if not available you
> would use the user preference, then the user global preference, and
> finally the wiki default.
>
> The challenge will be at providing a suitable interface for preferences,
> setting explicit preferences (currently we just remove the user
> preference if the user chooses the default), switching between global
> and local preferences (I'd make the page for the global ones the
> default), etc.
>
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-16 Thread Platonides
On 16/02/12 14:11, Jeremias Nuñez wrote:
> When you say "extensive consultation" are you referring to the UI or to
> which preferences could be made global? I think we talked about this on IRC
> too but I don't quite remember.
> 
> I'd also appreciate if you could point me where I could start "diving in",
> so I can get a sense of how things were done.
> 
> Thanks again.

The problem is the social part. For instance, some people will want
their skin to be global, while others will moan because they "need" a
different one between two wikis.

It should be much easier with the "new" (3 yo) preferences system.
Conceptually, I think the changes needed to support it are easy: instead
of using the user preference or the wiki default if not available you
would use the user preference, then the user global preference, and
finally the wiki default.

The challenge will be at providing a suitable interface for preferences,
setting explicit preferences (currently we just remove the user
preference if the user chooses the default), switching between global
and local preferences (I'd make the page for the global ones the
default), etc.


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-16 Thread Platonides
On 16/02/12 17:44, Mark A. Hershberger wrote:
>> 3 - Would it be too big of a project for a single person to take on?
>> (given the amount of time to actually implement it for GSoC)
> 
> If nothing else, I'd like signatures copied.  That should be simple
> enough, but I don't know enough about how the emails are copied.  It
> seems like this could at least be a start to create a method of copying
> preferences if not doing the whole thing.

The email belongs to the SUL account.
In fact, it's not a preference, as it lives in user table for normal
installs.


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-16 Thread Robin Pepermans
I am planning to participate in GSoC, and was looking for a project.
Global preferences also came to my mind, and now I see this thread. So
if Jeremias doesn't take this project, I will consider doing it.

2012/2/16 Sumana Harihareswara :
> On 02/15/2012 04:39 PM, Andrew Garrett wrote:
>> On Wed, Feb 15, 2012 at 4:29 AM, Jeremias Nunez wrote:
>>
>>>
>>> Hey everyone,
>>>
>>> Reading around MW and looking into some ideas for GSoC I ran into this
>>> bug, https://bugzilla.wikimedia.org/show_bug.cgi?id=14950, which I
>>> noticed (correct me if i'm wrong) isn't currently being worked on, and as
>>> Roan Kattouw said on one of the last comments, hasn't ever been written yet
>>> (i know it's kind of old, but anyway).
>>>
>>> I wanted to get some feedback on the next couple of things:
>>>
>>> 1 - Is this a feature that could actually be needed/helpful/useful?
>>> 2 - Is it true that it's currently not being worked on?
>>> 3 - Would it be too big of a project for a single person to take on?
>>> (given the amount of time to actually implement it for GSoC)
>>>
>>
>> We discussed this on IRC. A quick recap:
>>
>> I tried to implement this when I completely refactored the preferences
>> system in 2009. It was eventually reverted in
>> r49932.
>> The main blocker was basically considering a way to decide *which*
>> preferences would have their values synchronised. A UI would need to be
>> developed for that and you'd need some extensive consultation on that fact.
>>
>> If you were to implement this, you could potentially use my original
>> implementation as a guide, though it is reasonably "in the guts" of
>> MediaWiki so you'd have to be reasonably confident "code diving" into
>> unfamiliar software packages.
>>
>> I'll defer to Sumana on whether or not the project is of an appropriate
>> size for GSoC.
>>
>> —Andrew
>
> Jeremias:
>
> First, some general advice: look at previous people's experiences.  Past
> projects:
>
> http://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects
>
> And, just as a reminder: You're trying to budget three months of working
> about 40 hours per week, plus some ramping-up time in the spring and
> whatever you can fit around your other commitments between now and late
> April.  The timeline:
>
> http://www.google-melange.com/gsoc/events/google/gsoc2012
>
> Now, specifically about Global User Preferences.  From what Andrew has
> said about this, it's a task that would just be a few hundred lines of
> code, perhaps two weeks' worth of work, for a skilled MediaWiki
> developer with years of experience, comprehensive designs and
> specifications.  I imagine that if you either had the UI specification
> OR years of experience, but not both, the work would take somewhat
> longer.  But you don't have either, so maybe you should spend a few days
> investigating the issue to figure out how big and unwieldy it is, on a
> code level and on an interaction design level.  Then you can think about
> what the majority of the work would be, and perhaps consider whether you
> want to do THIS as a project, or whether it gives you an idea for a
> related project that is less unwieldy.
>
> I asked Andrew for examples of past MediaWiki GSoC projects that were
> well-scoped, and he suggested Visual Diff, Image Rendering Daemon, and
> the SocialProfile work.  So you could look at those.
>
> Looking forward to continuing our conversation as you learn more and
> think about what a well-scoped project would be.
>
> You wrote:
>> When you say "extensive consultation" are you referring to the UI or to
>> which preferences could be made global? I think we talked about this on IRC
>> too but I don't quite remember.
>
> It will be worth your time to install and learn a real IRC client, such
> as xchat, that saves IRC logs.  You can also read IRC channel logs at
> http://toolserver.org/~mwbot/ for #mediawiki and
> http://prototype.wikimedia.org/logs/%23wikimedia-dev/ for #wikimedia-dev.
>
> --
> Sumana Harihareswara
> Volunteer Development Coordinator
> Wikimedia Foundation
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-16 Thread Sumana Harihareswara
On 02/15/2012 04:39 PM, Andrew Garrett wrote:
> On Wed, Feb 15, 2012 at 4:29 AM, Jeremias Nunez wrote:
> 
>>
>> Hey everyone,
>>
>> Reading around MW and looking into some ideas for GSoC I ran into this
>> bug, https://bugzilla.wikimedia.org/show_bug.cgi?id=14950, which I
>> noticed (correct me if i'm wrong) isn't currently being worked on, and as
>> Roan Kattouw said on one of the last comments, hasn't ever been written yet
>> (i know it's kind of old, but anyway).
>>
>> I wanted to get some feedback on the next couple of things:
>>
>> 1 - Is this a feature that could actually be needed/helpful/useful?
>> 2 - Is it true that it's currently not being worked on?
>> 3 - Would it be too big of a project for a single person to take on?
>> (given the amount of time to actually implement it for GSoC)
>>
> 
> We discussed this on IRC. A quick recap:
> 
> I tried to implement this when I completely refactored the preferences
> system in 2009. It was eventually reverted in
> r49932.
> The main blocker was basically considering a way to decide *which*
> preferences would have their values synchronised. A UI would need to be
> developed for that and you'd need some extensive consultation on that fact.
> 
> If you were to implement this, you could potentially use my original
> implementation as a guide, though it is reasonably "in the guts" of
> MediaWiki so you'd have to be reasonably confident "code diving" into
> unfamiliar software packages.
> 
> I'll defer to Sumana on whether or not the project is of an appropriate
> size for GSoC.
> 
> —Andrew

Jeremias:

First, some general advice: look at previous people's experiences.  Past
projects:

http://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects

And, just as a reminder: You're trying to budget three months of working
about 40 hours per week, plus some ramping-up time in the spring and
whatever you can fit around your other commitments between now and late
April.  The timeline:

http://www.google-melange.com/gsoc/events/google/gsoc2012

Now, specifically about Global User Preferences.  From what Andrew has
said about this, it's a task that would just be a few hundred lines of
code, perhaps two weeks' worth of work, for a skilled MediaWiki
developer with years of experience, comprehensive designs and
specifications.  I imagine that if you either had the UI specification
OR years of experience, but not both, the work would take somewhat
longer.  But you don't have either, so maybe you should spend a few days
investigating the issue to figure out how big and unwieldy it is, on a
code level and on an interaction design level.  Then you can think about
what the majority of the work would be, and perhaps consider whether you
want to do THIS as a project, or whether it gives you an idea for a
related project that is less unwieldy.

I asked Andrew for examples of past MediaWiki GSoC projects that were
well-scoped, and he suggested Visual Diff, Image Rendering Daemon, and
the SocialProfile work.  So you could look at those.

Looking forward to continuing our conversation as you learn more and
think about what a well-scoped project would be.

You wrote:
> When you say "extensive consultation" are you referring to the UI or to
> which preferences could be made global? I think we talked about this on IRC
> too but I don't quite remember.

It will be worth your time to install and learn a real IRC client, such
as xchat, that saves IRC logs.  You can also read IRC channel logs at
http://toolserver.org/~mwbot/ for #mediawiki and
http://prototype.wikimedia.org/logs/%23wikimedia-dev/ for #wikimedia-dev.

-- 
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-16 Thread Shivansh Srivastava
Hi All,

I replied back on most of the points, mentioning the prudence of each idea.

Could I get some assessment on a couple of things:-

1. Are these ideas & features needed or useful?
2. I reckon, except for jQuery with AJAX (which I will clarify on IRC),
Voice Search & HTML5, the other points have been remarked upon. It would be
very kind if I could know that a bunch of these projects can be done as
part of a single Project.
3. Kindly also let me know if there is any scope of introduction of HTML5
in Wiki. Or if something of the same sort has been done in the past; maybe
which can be improved upon; or something entirely new can be implemented.

With Regards,

On 15 February 2012 03:42, K. Peachey  wrote:

> On Wed, Feb 15, 2012 at 3:19 AM, Ryan Lane  wrote:
> >> 3. jQuery drop down menu - I wanted to implement this functionality on
> >> every page. I had seen the SignUP API & wanted this universally. If
> there
> >> are security issues with AJAX, then there is no need to even implement
> the
> >> jQuery alongside. (Idea dropped)
> >>
> >
> > Well, just because it would be insecure the way the Wikimedia sites
> > are configured doesn't mean it isn't useful for others. Any site that
> > is either 100% http or 100% https could use this.
> >
> > - Ryan
>
> The security issues I was referring to (And Dantman kinda pointed were
> slightly non issues) was any JS can be run on those pages (so people
> could do things to the login forms) where as on the separate login
> page, We are a lot more restrictive about what JS can be run thus
> slightly mitigating that risk.
>
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
Shivansh Srivastava | +91-955-243-5407 |
http://in.linkedin.com/pub/shivansh-srivastava/17/a50/b18
 Secretary, BITS Alumni Affairs Division
| Web Expert, Newsletter, BITSAA International
3rd Year Undergraduate | B.E. (Hons.) - Electronics & Instrumentation
BITS-Pilani.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-16 Thread Mark A. Hershberger
Jeremias Nunez  writes:

> 1 - Is this a feature that could actually be needed/helpful/useful?

I often have to copy my sig to a new wiki whenever I sign into it to
make a comment.  My email is copied to every wiki automatically, why not
my signature?

I would LOVE to have this implemented.

> 2 - Is it true that it's currently not being worked on?

I don't know of anyone working on this.

> 3 - Would it be too big of a project for a single person to take on?
> (given the amount of time to actually implement it for GSoC)

If nothing else, I'd like signatures copied.  That should be simple
enough, but I don't know enough about how the emails are copied.  It
seems like this could at least be a start to create a method of copying
preferences if not doing the whole thing.

-- 
Mark A. Hershberger
Bugmeister
Wikimedia Foundation
m...@wikimedia.org

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-16 Thread Jeremias Nuñez
When you say "extensive consultation" are you referring to the UI or to
which preferences could be made global? I think we talked about this on IRC
too but I don't quite remember.

I'd also appreciate if you could point me where I could start "diving in",
so I can get a sense of how things were done.

Thanks again.

2012/2/15 Andrew Garrett 

> On Wed, Feb 15, 2012 at 4:29 AM, Jeremias Nunez  >wrote:
>
> >
> > Hey everyone,
> >
> > Reading around MW and looking into some ideas for GSoC I ran into this
> > bug, https://bugzilla.wikimedia.org/show_bug.cgi?id=14950, which I
> > noticed (correct me if i'm wrong) isn't currently being worked on, and as
> > Roan Kattouw said on one of the last comments, hasn't ever been written
> yet
> > (i know it's kind of old, but anyway).
> >
> > I wanted to get some feedback on the next couple of things:
> >
> > 1 - Is this a feature that could actually be needed/helpful/useful?
> > 2 - Is it true that it's currently not being worked on?
> > 3 - Would it be too big of a project for a single person to take on?
> > (given the amount of time to actually implement it for GSoC)
> >
>
> We discussed this on IRC. A quick recap:
>
> I tried to implement this when I completely refactored the preferences
> system in 2009. It was eventually reverted in
> r49932.
> The main blocker was basically considering a way to decide *which*
> preferences would have their values synchronised. A UI would need to be
> developed for that and you'd need some extensive consultation on that fact.
>
> If you were to implement this, you could potentially use my original
> implementation as a guide, though it is reasonably "in the guts" of
> MediaWiki so you'd have to be reasonably confident "code diving" into
> unfamiliar software packages.
>
> I'll defer to Sumana on whether or not the project is of an appropriate
> size for GSoC.
>
> —Andrew
>
> --
> Andrew Garrett
> Wikimedia Foundation
> agarr...@wikimedia.org
> ___
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-15 Thread Andrew Garrett
On Wed, Feb 15, 2012 at 4:29 AM, Jeremias Nunez wrote:

>
> Hey everyone,
>
> Reading around MW and looking into some ideas for GSoC I ran into this
> bug, https://bugzilla.wikimedia.org/show_bug.cgi?id=14950, which I
> noticed (correct me if i'm wrong) isn't currently being worked on, and as
> Roan Kattouw said on one of the last comments, hasn't ever been written yet
> (i know it's kind of old, but anyway).
>
> I wanted to get some feedback on the next couple of things:
>
> 1 - Is this a feature that could actually be needed/helpful/useful?
> 2 - Is it true that it's currently not being worked on?
> 3 - Would it be too big of a project for a single person to take on?
> (given the amount of time to actually implement it for GSoC)
>

We discussed this on IRC. A quick recap:

I tried to implement this when I completely refactored the preferences
system in 2009. It was eventually reverted in
r49932.
The main blocker was basically considering a way to decide *which*
preferences would have their values synchronised. A UI would need to be
developed for that and you'd need some extensive consultation on that fact.

If you were to implement this, you could potentially use my original
implementation as a guide, though it is reasonably "in the guts" of
MediaWiki so you'd have to be reasonably confident "code diving" into
unfamiliar software packages.

I'll defer to Sumana on whether or not the project is of an appropriate
size for GSoC.

—Andrew

-- 
Andrew Garrett
Wikimedia Foundation
agarr...@wikimedia.org
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-15 Thread Bináris
2012/2/15 Jeremias Nunez 

>
> 1 - Is this a feature that could actually be needed/helpful/useful?
>

Would be the most useful thing since sliced bread! I miss it very much. Go
on! :-)

-- 
Bináris
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSoC 2012 project thoughts (Global User Preferences)

2012-02-15 Thread Jeremias Nunez

Hey everyone, 

Reading around MW and looking into some ideas for GSoC I ran into this bug, 
https://bugzilla.wikimedia.org/show_bug.cgi?id=14950, which I noticed (correct 
me if i'm wrong) isn't currently being worked on, and as Roan Kattouw said on 
one of the last comments, hasn't ever been written yet (i know it's kind of 
old, but anyway).

I wanted to get some feedback on the next couple of things:

1 - Is this a feature that could actually be needed/helpful/useful?
2 - Is it true that it's currently not being worked on?
3 - Would it be too big of a project for a single person to take on? (given the 
amount of time to actually implement it for GSoC)

Thanks for your time.

Jere.

  
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-15 Thread Akshay Agarwal
Hi,
 I have made a TODO list of what needs to be done to take this
extension to the next level & get it deployment ready, will keep updating
my user page as I progress. My ongoing exams have made the progress slow, I
will be back in swing once its completed at the end of this month.

Thanks!

- Akshay

On Tue, Feb 14, 2012 at 6:59 AM, Sumana Harihareswara  wrote:

> On 02/13/2012 08:10 PM, K. Peachey wrote:
> > On Tue, Feb 14, 2012 at 10:38 AM, Shivansh Srivastava
> >  wrote:
> >> 3. jQuery drop menu for login - (
> >>
> https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/Login/index.html#
> )
> >> - Can be integrated with AJAX for an on the page account validation or
> >> creation; without having to go to a different page. (
> >
> > I believe we had a GSOC project not long ago (Last year?) to improve
> > the login progress (including the AJAX side of things and API support
> > for it). Although for security reasons I believe we would want to
> > leave the Login stuff on it's own page (for security reasons).
>
> Yeah, I think Shivansh didn't look carefully enough at
> http://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects when I
> pointed him there to compare his ideas with other past projects.  :-)
>
> https://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects#Signup_API
>
> Signup API (Original name: "Account Creation, Login Screens and
> AJAX-ification of everything"). Goal: Implement the Account creation
> proposal, which would allow for Ajax-based login. This was implemented
> as the SignupAPI extension.
>
> Status: The SignupAPI extension is currently at "experimental" status.
> Some of Akshay Agarwal's commits have received code critique and await
> improvement. Akshay is responding to those critiques and working towards
> the goal of deploying the extension on WMF wikis.
>
> CC'ing Akshay.  Akshay, how is that going?
>
> --
> Sumana Harihareswara
> Volunteer Development Coordinator
> Wikimedia Foundation
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-14 Thread K. Peachey
On Wed, Feb 15, 2012 at 3:19 AM, Ryan Lane  wrote:
>> 3. jQuery drop down menu - I wanted to implement this functionality on
>> every page. I had seen the SignUP API & wanted this universally. If there
>> are security issues with AJAX, then there is no need to even implement the
>> jQuery alongside. (Idea dropped)
>>
>
> Well, just because it would be insecure the way the Wikimedia sites
> are configured doesn't mean it isn't useful for others. Any site that
> is either 100% http or 100% https could use this.
>
> - Ryan

The security issues I was referring to (And Dantman kinda pointed were
slightly non issues) was any JS can be run on those pages (so people
could do things to the login forms) where as on the separate login
page, We are a lot more restrictive about what JS can be run thus
slightly mitigating that risk.

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-14 Thread Ryan Lane
> 3. jQuery drop down menu - I wanted to implement this functionality on
> every page. I had seen the SignUP API & wanted this universally. If there
> are security issues with AJAX, then there is no need to even implement the
> jQuery alongside. (Idea dropped)
>

Well, just because it would be insecure the way the Wikimedia sites
are configured doesn't mean it isn't useful for others. Any site that
is either 100% http or 100% https could use this.

- Ryan

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-14 Thread Helder
Hi!

On Tue, Feb 14, 2012 at 15:02, Shivansh Srivastava
 wrote:
> 1. Page Preview - http://en.wikipedia.org/wiki/MediaWiki:Gadget-popups.js -
> I had seen this, it takes a bit of time to load. I'll try & use/edit modify
> it on my common.js more, see if I can be a bit more creative/improve upon
> it. Would have to get back on this later.

I was wondering if while you are looking into this code you could help
non-English projects by fixing this bug?
http://en.wikipedia.org/wiki/MediaWiki_talk:Gadget-popups.js#i18_for_Special:Contributions

Best regards,
Helder

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-14 Thread Shivansh Srivastava
Hi All,

Thanks a lot for your inputs.

The reason I proposed more than a few ideas was so that I could get a
feedback from the community, so that I could narrow/filter them down. I was
a little unsure whether or not these projects could be finished over the
Summer (16 weeks I presumed from last years dates of 11th May to 30th
August) & hence narrowing down to the more required projects projected a
more modeled aim.

With regard to the previous mails, I'll go point by point.

1. Page Preview - http://en.wikipedia.org/wiki/MediaWiki:Gadget-popups.js -
I had seen this, it takes a bit of time to load. I'll try & use/edit modify
it on my common.js more, see if I can be a bit more creative/improve upon
it. Would have to get back on this later.

2. Image Preview - Can be integrated with Image magnification X times. More
aptly shown here - http://james.padolsey.com/demos/imgPreview/full/ &
http://cssglobe.com/lab/tooltip/02/

3. jQuery drop down menu - I wanted to implement this functionality on
every page. I had seen the SignUP API & wanted this universally. If there
are security issues with AJAX, then there is no need to even implement the
jQuery alongside. (Idea dropped)

5. News Ticker - News Ticker at the top of every page, which shows the
latest headlines, fed from an RSS feed. I saw in the BBC Website (
http://www.bbc.co.uk/news/), & I consider it an informative addition to a
page for any user visiting a page. RSS can be taken from WikiNews.org

6. OnScreen Keypad/Keyboard - I wasnt aware of the Extension (
https://www.mediawiki.org/wiki/Extension:Narayam). But isnt this only for
some specific dialects. What I had in mind was only for En Wikipedia. JS
would enable it whenever user enables that as a gadget. Should I drop this
idea then?

8. HTML5 Implementation - Kindly find attached a small html file that
changes Hue Saturation Colour of text. Could something related/better with
HTML5 be implemented?

7. Voice Search -
(http://en.wikipedia.org/wiki/User:Shivansh13/common.js -Extension
5). I want to implement it in the Search box directly. The Search box can
be seen at the end of the page. Apologies for poor js there, its just a
general idea.

10. Bookmark & Sharethis - A very small application. But guess the
specified Extension Sharebox does the bit. (Idea dropped)

11. Accordian - Again for long paragraphs. I guess, would have to prepend a
div before each para found on the page, & jquery could be added in the js.
Suggestions?

12. Float Menu - (
https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/FloatingToc/index.html)
- JS + CSS combined on this. I would consider it useful for long pages.
Suggestions?

13. Notified scrollbar - Actually it would be an awesome functionality.
Main intention would, again, be for long pages. The scroll bar be divided &
at the top of each scrollbar would be a label of the div id. Something to
think about?

I hope I have replied appropriately & covered all points. Please reply with
suggestions.

I am doing this drill well before the submission list of GSoC, so that I
can have an approx idea/heads up on what I am required to do, if I get a go
ahead.

@Sumana Ma'am - I am still learning the bits of the Open Source World.
Hence, kindly overlook the imprudence of my questions, which may come from
time to time. :-P :-D

With Regards,

On 14 February 2012 06:59, Sumana Harihareswara wrote:

> On 02/13/2012 08:10 PM, K. Peachey wrote:
> > On Tue, Feb 14, 2012 at 10:38 AM, Shivansh Srivastava
> >  wrote:
> >> 3. jQuery drop menu for login - (
> >>
> https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/Login/index.html#
> )
> >> - Can be integrated with AJAX for an on the page account validation or
> >> creation; without having to go to a different page. (
> >
> > I believe we had a GSOC project not long ago (Last year?) to improve
> > the login progress (including the AJAX side of things and API support
> > for it). Although for security reasons I believe we would want to
> > leave the Login stuff on it's own page (for security reasons).
>
> Yeah, I think Shivansh didn't look carefully enough at
> http://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects when I
> pointed him there to compare his ideas with other past projects.  :-)
>
> https://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects#Signup_API
>
> Signup API (Original name: "Account Creation, Login Screens and
> AJAX-ification of everything"). Goal: Implement the Account creation
> proposal, which would allow for Ajax-based login. This was implemented
> as the SignupAPI extension.
>
> Status: The SignupAPI extension is currently at "experimental" status.
> Some of Akshay Agarwal's commits have received code critique and await
> improvement. Akshay is responding to those critiques and working towards
> the goal of deploying the extension on WMF wikis.
>
> CC'ing Akshay.  Akshay, how is that going?
>
> --
> Sumana Harihareswara
> Volunteer Development Coordinator
> Wikimedia Foundation
>

Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-13 Thread Sumana Harihareswara
On 02/13/2012 08:10 PM, K. Peachey wrote:
> On Tue, Feb 14, 2012 at 10:38 AM, Shivansh Srivastava
>  wrote:
>> 3. jQuery drop menu for login - (
>> https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/Login/index.html#)
>> - Can be integrated with AJAX for an on the page account validation or
>> creation; without having to go to a different page. (
> 
> I believe we had a GSOC project not long ago (Last year?) to improve
> the login progress (including the AJAX side of things and API support
> for it). Although for security reasons I believe we would want to
> leave the Login stuff on it's own page (for security reasons).

Yeah, I think Shivansh didn't look carefully enough at
http://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects when I
pointed him there to compare his ideas with other past projects.  :-)

https://www.mediawiki.org/wiki/Summer_of_Code_Past_Projects#Signup_API

Signup API (Original name: "Account Creation, Login Screens and
AJAX-ification of everything"). Goal: Implement the Account creation
proposal, which would allow for Ajax-based login. This was implemented
as the SignupAPI extension.

Status: The SignupAPI extension is currently at "experimental" status.
Some of Akshay Agarwal's commits have received code critique and await
improvement. Akshay is responding to those critiques and working towards
the goal of deploying the extension on WMF wikis.

CC'ing Akshay.  Akshay, how is that going?

-- 
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-13 Thread Daniel Friesen

On Mon, 13 Feb 2012 17:10:47 -0800, K. Peachey  wrote:


On Tue, Feb 14, 2012 at 10:38 AM, Shivansh Srivastava
 wrote:

3. jQuery drop menu for login - (
https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/Login/index.html#)
- Can be integrated with AJAX for an on the page account validation or
creation; without having to go to a different page. (


I believe we had a GSOC project not long ago (Last year?) to improve
the login progress (including the AJAX side of things and API support
for it). Although for security reasons I believe we would want to
leave the Login stuff on it's own page (for security reasons).


The idea that login is secure because it's on a separate page than the  
rest of the site is actually an old mistake.
If a script is included ANYWHERE on the site on the same domain then it's  
possible to inject in some code that will fake pageviews in a way that  
will let an attacker have a running script when the user follows the login  
link to the login page.
So there isn't really any security advantage of a separate login page over  
an ajax login. (well ;) unless you're using the separate login page  
because you have js disabled, then you're safe, heh)


--
~Daniel Friesen (Dantman, Nadir-Seen-Fire) [http://daniel.friesen.name]

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-13 Thread Sumana Harihareswara
I talked to Shivansh in IRC earlier today.

On 02/13/2012 07:38 PM, Shivansh Srivastava wrote:
> I understand there are 16 weeks in which I would have to complete my
> project(s). So, kindly also inform me what would happen if the time taken
> by me exceeds & extends after the summer? Would I be allowed to complete
> the remaining projects afterwards?

As I mentioned in IRC, you should write proposals that you can complete
by August 20th, presuming that you start coding on May 21st.  Where did
you get the 16 week figure?  Are you counting the April 23rd-May 21st
"community bonding period"?  You could start coding during that time if
you're ready, but most people have other things they need to do during
that time, like design work, learning coding standards, learning to use
the source control system and code review system, and so on.

The timeline:

http://www.google-melange.com/gsoc/events/google/gsoc2012

As I had mentioned in IRC, if you set goals for your proposal and then
you don't finish them by August 20th, you may fail GSoC, which would
mean that you wouldn't get the second half of the payment.  Of course
you could continue working on any suggestions or features that you want
-- I'm not sure where you got the idea that you would not be allowed to.
 Anyone can try to work on a suggested improvement to MediaWiki at any
time, with or without Google Summer of Code.  And even if you don't get
accepted into GSoC, you can contribute whatever you want and work to
improve MediaWiki.

> Also, it would be great if there are any more ideas that can be included
> here to improve User Interactivity.

The reason I suggested that you talk to Brandon is that he is the senior
designer at the Wikimedia Foundation.  Here are some of his projects:
https://wikimediafoundation.org/wiki/User:Jorm  Look at them and think
about whether they would affect any of your designs or suggestions.


-- 
Sumana Harihareswara
Volunteer Development Coordinator
Wikimedia Foundation

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-13 Thread K. Peachey
On Tue, Feb 14, 2012 at 10:38 AM, Shivansh Srivastava
 wrote:
> 1. Page Preview (onmouseover) - A small Dialog box, that could display the
> Intro para of the hypertext/link in an iframe or related text. (1 week +
> testing) . I'll need to discuss this more

There is a nice little user based gadget on En.Wikipedia that already
does this and a bit extra called Navigation Popups[1], Which you might
want to look up playing with, compared to starting completely from
scratch.


> 2. Image Preview (onclick) or (Rollover Effect to provide more description)
> or (URL description snapshot preview) - A dialog or lightbox, that gives a
> preview of Image or provides a description for it. It would be useful when
> its difficult to find the corresponding description for the same in lengthy
> paragraphs. (4-5 days +1 day testing)

I would be slightly hard to find a "description" of the file, since
not all file description pages contain one, or have it in a easy to
find location on the page (since we don't store that information
separate except as part of the overall wikipage, And then there is
also the possibility the description on that page isn't extactlly the
same as what they are trying to convey on the other wikipage/article.



> 3. jQuery drop menu for login - (
> https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/Login/index.html#)
> - Can be integrated with AJAX for an on the page account validation or
> creation; without having to go to a different page. (

I believe we had a GSOC project not long ago (Last year?) to improve
the login progress (including the AJAX side of things and API support
for it). Although for security reasons I believe we would want to
leave the Login stuff on it's own page (for security reasons).

> 6. OnScreen Keypad/Keyboard - for people who mostly depend on mouse over
> keyboard, on screen keyboard would be displayed, when clicked on the Search
> box. (4-5 Days + 2Days testing)

We already have a extension to do this (Extension:Narayam[5]) although
that is primary designed to help for editing in language scripts other
than English. If a user was dependant or utilized mouse over keyboard,
I would imagine that they would use something locally compared to
hoping the webpage provided this functionality?

> 10. Bookmark & ShareThis - Should be present on all pages. Sharing on
> common networking sites, tweeting, etc.

There is already a few extensions (Eg: Extension:Addthis[3]) and
gadgets (Eg: Sharebox[4]) that do functionality, So I'm not quiet sure
what benefit this would be to do as part of your GSOC project.

> 11. Accordion - (For paras) - Long paragraphs can be hidden/shown on click
> to save time, space for scroll. (1 Week)

We do this on the mobile site[2] because of the smaller screens, But
on the larger screens I'm sure it would be that helpful because we
would want it defaulted to non-collapsed on page load I would think.

> 13. Notified ScrollBar (jQuery) - A scrollbar at the right side of the
> screen, which will distinctly be divided into minibars, depending on the
> div id & length of the paragraphs/text. (1.5 Weeks)
> I could send a modified page to explain it better.

How would this fair usability wise? I would of thought that would be
bad for usability?

[1]. 
[2]. 
[3]. 
[4]. 
[5]. 

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] GSoC 2012 Ideas/Projects

2012-02-13 Thread Shivansh Srivastava
Hi Sir,

Kindly find below the abstract of what I want to propose for GSoC '12.
Could you please review it & suggest to make it better before mailing it on
te Wikitech-I mailing list. Kindly tell me whether I need to be more
elaborate & whether they can be taken up as projects under GSoC '12.

Since GSoC 2012 has been announced officially, I wanted to give an abstract
of what I want to implement and propose a few ideas I have in mind for the
same. Kindly consider the ideas below & review the same. I have also
mentioned the approx time I might be taking for each idea.

I understand there are 16 weeks in which I would have to complete my
project(s). So, kindly also inform me what would happen if the time taken
by me exceeds & extends after the summer? Would I be allowed to complete
the remaining projects afterwards?

*Overview- *

My main aim is improving the Front end of the Wiki for improved User
Interactivity.
I have spent some time in understanding the concepts of Gadgets &
Extensions. I havent authored any gadgets yet & currently, I implement the
javascript in common.js. I would mostly be implementing the following
ideas, if allowed, as gadgets.

I have been implementing/checking for different upon jQuery/JavaScripts &
CSS.
I have only been working on the common.js -
http://en.wikipedia.org/wiki/User:Shivansh13/common.js &
http://en.wikipedia.org/wiki/User:Shivansh13/common.
css
(I am still working on the 4th & 5th extensions there)

implementing the abstract I tried to show here at the WikiMedia Conference
in Mumbai -
https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/

*My ideas/proposals are:-*

1. Page Preview (onmouseover) - A small Dialog box, that could display the
Intro para of the hypertext/link in an iframe or related text. (1 week +
testing) . I'll need to discuss this more

2. Image Preview (onclick) or (Rollover Effect to provide more description)
or (URL description snapshot preview) - A dialog or lightbox, that gives a
preview of Image or provides a description for it. It would be useful when
its difficult to find the corresponding description for the same in lengthy
paragraphs. (4-5 days +1 day testing)

3. jQuery drop menu for login - (
https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/Login/index.html#)
- Can be integrated with AJAX for an on the page account validation or
creation; without having to go to a different page. (

4. Lightbox (onclick) - To view a particular image enlarged, embedded
audio, videos, flash in a dialog box. (1 week + 2 days testing)

5. News Ticker - I want to implement a News Ticker at the top of every
page, which shows the latest headlines, fed from an RSS feed. (1 Week + 1
days testing)

6. OnScreen Keypad/Keyboard - for people who mostly depend on mouse over
keyboard, on screen keyboard would be displayed, when clicked on the Search
box. (4-5 Days + 2Days testing)

7. Voice Search - I am working on adding a Voice Search through Javascript;
(http://en.wikipedia.org/wiki/User:Shivansh13/common.js -Extension 5). I
want to implement it in the Search box directly.

8. HTML5 - Need help in implementing this & ideas for the same. For a
start, I want to change hue, saturation, colour, etc of the screen & text
dynamically - I am currently working on the javascript.

9. OnMouseOver -Enlarge an image position by position & magnify it X times
& show it in an iframe above the mouse pointer. (1 Week)

10. Bookmark & ShareThis - Should be present on all pages. Sharing on
common networking sites, tweeting, etc.

11. Accordion - (For paras) - Long paragraphs can be hidden/shown on click
to save time, space for scroll. (1 Week)

12. Float Menu - (
https://svn.wikimedia.org/svnroot/mediawiki/trunk/mockups/ajax-mockups/FloatingToc/index.html)
- Would need inputs, as it can be really helpful for long pages (Size
>2-2.5 MB) (1 Week)

13. Notified ScrollBar (jQuery) - A scrollbar at the right side of the
screen, which will distinctly be divided into minibars, depending on the
div id & length of the paragraphs/text. (1.5 Weeks)
I could send a modified page to explain it better.



Also, it would be great if there are any more ideas that can be included
here to improve User Interactivity.

Would I be allowed to take up all these under the name of a single project?

Waiting for a reply.

With Regards,
--
Shivansh Srivastava | +91-955-243-5407 |
http://in.linkedin.com/pub/shivansh-srivastava/17/a50/b18
 Secretary, BITS Alumni Affairs Division
| Web Expert, Newsletter, BITSAA International
3rd Year Undergraduate | B.E. (Hons.) - Electronics & Instrumentation
BITS-Pilani.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l