Re: [Wikitech-l] having trouble with importing XML dumps into database

2009-06-23 Thread Lars Aronsson
sr...@ischool.berkeley.edu wrote: > I am primarily interested in extracting, what image is linked > from the infobox of an article (if there is a infobox in the > article page). Initially i thought of parsing the xml for this http://meta.wikimedia.org/wiki/User:LA2/Extraktor > but then after

Re: [Wikitech-l] Image hovering effects

2009-06-23 Thread Roan Kattouw
2009/6/23 Brianna Laugher : > Also, for inline images without explicitly defined tooltips, the >> image name is used as the tooltip even though it is also shown in the URL >> when mousing over the image. Neither of these automatic tooltips are really >> useful, and they slow down page load time on

Re: [Wikitech-l] Image hovering effects

2009-06-23 Thread Daniel Kinzler
Tim Larson schrieb: > Remember the dot wrote: >> What do you think? Should we keep the redundant tooltips, or start leaving >> them out? > > I'm in the camp that considers them redundant. If the title isn't adding > anything that isn't already visible, it's not helping. > > One suggestion that h

Re: [Wikitech-l] Extending wikilinks syntax

2009-06-23 Thread Aryeh Gregor
On Tue, Jun 23, 2009 at 12:11 AM, Steve Bennett wrote: > Any better examples of why this would be a good thing? The other > example provided, coord strikes me as exactly the kind of weird > special case (it generally displays in the title area!) that deserves > to generate weird special xhtml... S

[Wikitech-l] Public repositories for research dumps

2009-06-23 Thread Felipe Ortega
Hello. Since just a few hours ago, a new public repository has been created to host WikiXRay database dumps, containing info extracted from public Wikipedia dbdumps. The image is hosted by RedIRIS (in short, the Spanish equivalent of Kennisnet in Netherlands). http://sunsite.rediris.es/mirror

Re: [Wikitech-l] Image hovering effects

2009-06-23 Thread Aryeh Gregor
On Mon, Jun 22, 2009 at 10:42 PM, Remember the dot wrote: > In Håkon Wium Lie's recent analysis of Wikipedia image markup ( > http://www.princexml.com/howcome/2009/wikipedia/image/), he makes a good > point: we include image captions both below images and again in the images' > tooltips. Also, for

Re: [Wikitech-l] Public repositories for research dumps

2009-06-23 Thread Bilal Abdul Kader
Hi Felipe,Thanks for the great effort. This will save us hours of downloading and importing older dumps. bilal On Tue, Jun 23, 2009 at 12:26 PM, Felipe Ortega wrote: > > Hello. > > Since just a few hours ago, a new public repository has been created to > host WikiXRay database dumps, containing

Re: [Wikitech-l] Different apostrophe signs and MediaWiki internal search

2009-06-23 Thread Brion Vibber
Steve Bennett wrote: > So, apostrophe (U+0027) -> curved right single quote (U+2019): yes, probably. > The other way around...probably not, unless that U+2019 exists on any > keyboards. > > Hyphen-minus (U+002D) -> em dash (U+2014): I would say no. If you > search for "clock-work", you probably d

Re: [Wikitech-l] Image hovering effects

2009-06-23 Thread Brion Vibber
Remember the dot wrote: > Hello fellow developers, > > In Håkon Wium Lie's recent analysis of Wikipedia image markup ( > http://www.princexml.com/howcome/2009/wikipedia/image/), he makes a good > point: we include image captions both below images and again in the images' > tooltips. Also, for inli

[Wikitech-l] subst'ing #if parser functions loses line breaks, and other oddities

2009-06-23 Thread William Allen Simpson
I've not done much template work since parser functions were new. Grabbing some old code examples, found it didn't work anymore. Workaround? === Ancient code (expected single space): {subst|}}}#if:{{{par1|}}}|[[Category:{{{par1}}}{subst|}}}#if:{{{key1|}}}{subst|}}}!}}{{{key1}]]

Re: [Wikitech-l] Different apostrophe signs and MediaWiki internal search

2009-06-23 Thread Andrew Dunbar
2009/6/23 Brion Vibber : > Steve Bennett wrote: >> So, apostrophe (U+0027) -> curved right single quote (U+2019): yes, probably. >> The other way around...probably not, unless that U+2019 exists on any >> keyboards. >> >> Hyphen-minus (U+002D) -> em dash (U+2014): I would say no. If you >> search

[Wikitech-l] Chinese-language search fixes

2009-06-23 Thread Brion Vibber
I've made some fixes to the MySQL search backend for Chinese and other languages using variants. Some languages don’t use word spacing, like Chinese and Japanese. To let the search index know where word boundaries are, we have to internally insert spaces between some characters: 维基百科 ->

Re: [Wikitech-l] subst'ing #if parser functions loses line breaks, and other oddities

2009-06-23 Thread Tim Starling
William Allen Simpson wrote: > {subst|}}}#if:{{{par1|}}}|[[Category:{{{par1}}}{subst|}}}#if:{{{key1|}}}{subst|}}}!}}{{{key1}]] > > }}{subst|}}}#if:{{{par2|}}}|[[Category:{{{par2}}}{subst|}}}#if:{{{key2|}}}{subst|}}}!}}{{{key2}]] > > }}{subst|}}}#if:{{{par3|}}

Re: [Wikitech-l] Image hovering effects

2009-06-23 Thread Remember the dot
On Tue, Jun 23, 2009 at 8:37 AM, Roan Kattouw wrote: > > 2009/6/23 Brianna Laugher : > > Also, for inline images without explicitly defined tooltips, the > >> image name is used as the tooltip even though it is also shown in the URL > >> when mousing over the image. Neither of these automatic tool