Re: [Wikitech-l] Project Idea " Extension: Offline MediaWiki "

2015-09-22 Thread Emmanuel Engelhart
On 22.09.2015 19:39, Brian Wolff wrote: > On 9/22/15, adisha porwal wrote: >> Greeting, >> I want to contribute to wikimedia and for that Outreachy >> intership program looks perfect fit for >> me. > > The most prominent is Kiwix ( http://www.kiwix.org/wiki/Main_

Re: [Wikitech-l] Modifications list and automated (cron?) task.

2015-02-14 Thread Emmanuel Engelhart
On 14.02.2015 14:39, Georges DICK wrote: I installed Mediawiki (and a couple of extensions) as an intranet website. Everyone in the company is entitled to publish, so my fear is people missing important information. My idea is to send a weekly e-mail to list every modified pages (link to page, a

Re: [Wikitech-l] Idea for new desktop / mobile kiwix like application

2015-01-26 Thread Emmanuel Engelhart
On 26.01.2015 13:09, Petr Bena wrote: 1) There may be no ZIM's for the wiki they want to use and they have no idea how to create one. They won't be able to use kiwix here. We propose a ZIM file for most of the "important" projects. The problem is that we still don't have the resources to gener

Re: [Wikitech-l] Idea for new desktop / mobile kiwix like application

2015-01-26 Thread Emmanuel Engelhart
Dear Petr On 23.01.2015 11:59, Petr Bena wrote: Some of you probably know kiwix - kiwix.org which is offline wikipedia reader. I think the idea of this reader is cool, most of you probably sometimes wanted to access wikipedia while being offline somewhere, but couldn't. Kiwix can help with this,

Re: [Wikitech-l] [MediaWiki-l] Google Code-in: First week achievements!

2014-12-28 Thread Emmanuel Engelhart
On 10.12.2014 15:30, Jonathan Aquilina wrote: Hey guys reading through some of these tasks, I am wondering has the wiki media foundation taken into account mobile devices and started taking media wiki down a mobile friendly path through using bootstrap etc? They are a dozen of tasks (mostly for

Re: [Wikitech-l] [Offline-l] Bug day: Book tool/Collection/PDF, 2014-10-08, 14–22 UTC

2014-10-09 Thread Emmanuel Engelhart
On 09.10.2014 00:35, Federico Leva (Nemo) wrote: And it's over! We reached our immediate goal, "closing" all the lost PediaPress tickets (80 before the bug day); and about 40 new bugzilla reports were filed, including some tricky ones about language support. https://www.mediawiki.org/wiki/Bug_ma

Re: [Wikitech-l] State of the DumpHTML extension

2014-10-02 Thread Emmanuel Engelhart
On 02.10.2014 19:15, Chris McMahon wrote: I may be mistaken, but isn't this done by Kiwix now? There was some discussion of that at http://www.kiwix.org/wiki/Mediawiki_DumpHTML_extension_improvement, and recent discussion here: https://blog.wikimedia.org/2014/09/12/emmanuel-engelhart-invent

Re: [Wikitech-l] Engineers in residence

2014-08-09 Thread Emmanuel Engelhart
On 08/09/2014 03:27 PM, Gilles Dubuc wrote: This is an idea I've had for a while, and I'd like to see if there's any interest, or on the contrary concerns, about it. I would like to explore (and if I have official blessing, champion) the idea of asking corporations with software engineering staff

Re: [Wikitech-l] Tech questions for outreach presentation

2014-06-01 Thread Emmanuel Engelhart
On 01.06.2014 08:44, ENWP Pine wrote: I would like to talk with someone who has broad knowledge on the tech side of the Wikimedia universe so I can ask some questions about hardware, Labs, and MediaWiki. An IRC conversation would be ideal because then I would have a written record that I could

Re: [Wikitech-l] download.wikimedia.org, dumps.wikimedia.org moves

2014-03-26 Thread Emmanuel Engelhart
Le 26/03/2014 08:36, Ariel T. Glenn a écrit : > These names will be moved so that requests to them go to our server in > the eqiad data center. This should not cause any service interruptions > but you may notice more current files available for download as the > switch goes into effect. > > Time

Re: [Wikitech-l] [Offline-l] The Whole Wikipedia in English with pictures in one 40GB big file

2014-03-08 Thread Emmanuel Engelhart
Le 07/03/2014 19:25, Asaf Bartov a écrit : > btw, are these new improved tools documented anywhere? > http://kiwix.org/wiki/Development does not seem to point in the right > direction. The usage is pretty straightforward (for IT people) and IMO everything necessary is explained in the READMEs: *

Re: [Wikitech-l] [Offline-l] The Whole Wikipedia in English with pictures in one 40GB big file

2014-03-02 Thread Emmanuel Engelhart
Le 02/03/2014 01:33, Samuel Klein a écrit : > Brilliant. Congrats to everyone who is working on this! > What is needed to scrape categories? 0 - For all dumped pages (so at least NS_MAIN and NS_CATEGORY pages), download the list of categories they belong to (with the MW API). 1 - For each dumped

Re: [Wikitech-l] The Whole Wikipedia in English with pictures in one 40GB big file

2014-03-01 Thread Emmanuel Engelhart
Le 01/03/2014 19:26, James Forrester a écrit : > On Saturday, March 1, 2014, Emmanuel Engelhart wrote: > fix a few issues on mwoffliner: >> * Recreate the "table of content" based on the HTML DOM (*) > > We are currently working on doing similar work to this in Visu

[Wikitech-l] The Whole Wikipedia in English with pictures in one 40GB big file

2014-03-01 Thread Emmanuel Engelhart
Hi For the first time, we have achieved to release a complete dump of all encyclopedic articles of the Wikipedia in English, *with thumbnails*. This ZIM file is 40 GB big and contains the current 4.5 million articles with their 3.5 millions pictures: http://download.kiwix.org/zim/wikipedia_en_all

Re: [Wikitech-l] Hong-Kong citizens being unable to upload big content to Commons

2014-02-12 Thread Emmanuel Engelhart
Le 11/02/2014 21:02, Antoine Musso a écrit : > Le 11/02/2014 19:06, Emmanuel Engelhart a écrit : >> It seems that people in Hong-Kong are not able to upload big content >> (for example videos) to Commons due to (what looks like to be) a lack of >> connectivity between their

[Wikitech-l] Hong-Kong citizens being unable to upload big content to Commons

2014-02-11 Thread Emmanuel Engelhart
Hi It seems that people in Hong-Kong are not able to upload big content (for example videos) to Commons due to (what looks like to be) a lack of connectivity between their Internet provider and the Wikimedia datacenters (but Youtube works well). I reported about that on Bugzilla: https://bugzilla

Re: [Wikitech-l] QRCode - QRpedia

2013-12-09 Thread Emmanuel Engelhart
Hi Rodrigo, I have open a bug for this: https://bugzilla.wikimedia.org/show_bug.cgi?id=58204 Your request is legitimate, it seems important too me to go ahead on this. If someone is motivated to implemented this feature, it would be a pleasure for me to review his patch. Otherwise, I will try to

Re: [Wikitech-l] QRCode - QRpedia

2013-12-04 Thread Emmanuel Engelhart
Hi Rodrigo Le 04/12/2013 15:44, Rodrigo Padula a écrit : > I'm starting some conversations here in Brazil about the GLAM and other > touristic ideas under the Brazilian Education Program. > > So, we are very interested to use the QRCodes linking to wikipedia > articles, so we are evaluating QRped

Re: [Wikitech-l] Re-implementing PDF support

2013-11-13 Thread Emmanuel Engelhart
Le 13/11/2013 17:10, Tyler Romeo a écrit : > On Wed, Nov 13, 2013 at 12:45 AM, Erik Moeller wrote: > >> Most likely, we'll end up using Parsoid's HTML5 output, transform it >> to add required bits like licensing info and prettify it, and then >> render it to PDF via phantomjs, but we're still loo

Re: [Wikitech-l] FOSDEM update

2013-09-30 Thread Emmanuel Engelhart
Le 25/09/2013 19:39, Quim Gil a écrit : > Wikimedia wants to have a stand, and we have received an offer to help > from the nascent Wikimedia Belgium chapter. Probably more help can be > aggregated from CH, DE, FR, NL, UK + other tech contributors in the > region? Let's do something really cool! To

Re: [Wikitech-l] OxygenGuide (Wikivoyage offline): Android app seeking new maintainer

2013-08-21 Thread Emmanuel Engelhart
Le 21/08/2013 22:15, Sumana Harihareswara a écrit : > (I'd love a forward to offline-l and mobile-l.) > > https://en.wikivoyage.org/wiki/Wikivoyage:Travellers%27_pub#OxygenGuide_.28offline_Wikivoyage.29_updated.21 > > http://code.google.com/p/oxygenguide > > "OxygenGuide is Wikivoyage in the for

Re: [Wikitech-l] FLAC support in Mediawiki/Commons

2013-06-14 Thread Emmanuel Engelhart
Le 13/06/2013 01:11, Matthew Flaschen a écrit : > On 06/12/2013 01:06 PM, Emmanuel Engelhart wrote: >> I have found two related features requests: >> * https://bugzilla.wikimedia.org/show_bug.cgi?id=20252 >> * https://bugzilla.wikimedia.org/show_bug.cgi?id=39867 > &

[Wikitech-l] FLAC support in Mediawiki/Commons

2013-06-12 Thread Emmanuel Engelhart
Hi I'm not an audio expert but AFAIK, FLAC is probably the best solution to store lossless audio streams. Similar to TIFF for pictures. The difference (with TIFF) is that we don't have FLAC support for now in Mediawiki... and it seems there is no way at all to upload audio streams in a lossless f

Re: [Wikitech-l] GSoC Project

2013-04-29 Thread Emmanuel Engelhart
Dear Kiran Before commenting your proposal, let me thank: * Quim for having renamed this thread... I wouldn't have got a chance to read it otherwise. * Gnosygnu and Sumana for their previous answers. Your emails points three problems: (1) The size of the offline dumps (2) Server mode of the offli

Re: [Wikitech-l] Get ArchiveLinks the last step to completion

2012-11-18 Thread Emmanuel Engelhart
Really essential extension to finish and bring in prod! Unfortunately, no time to work on that :( Emmanuel Le 18/11/2012 13:36, Sumana Harihareswara a écrit : > The Internet Archive wants to particularly make sure to archive pages > that Wikipedians use as citations. A GSoC project last year got

Re: [Wikitech-l] Media Author/License information in the database

2012-10-17 Thread Emmanuel Engelhart
Le 17/10/2012 10:07, David Gerard a écrit : > On 17 October 2012 09:02, Platonides wrote: > >> Note however that there are some pictures with multiple authors >> (derivative works, collages...) and those are harder to determine and >> store (a simple field for the author is not enough). > > And

Re: [Wikitech-l] Media Author/License information in the database

2012-10-17 Thread Emmanuel Engelhart
Le 17/10/2012 10:07, David Gerard a écrit : > On 17 October 2012 09:02, Platonides wrote: > >> Note however that there are some pictures with multiple authors >> (derivative works, collages...) and those are harder to determine and >> store (a simple field for the author is not enough). > > > A

Re: [Wikitech-l] Media Author/License information in the database

2012-10-17 Thread Emmanuel Engelhart
Le 17/10/2012 10:07, David Gerard a écrit : > On 17 October 2012 09:02, Platonides wrote: > >> Note however that there are some pictures with multiple authors >> (derivative works, collages...) and those are harder to determine and >> store (a simple field for the author is not enough). > > > A

Re: [Wikitech-l] Media Author/License information in the database

2012-10-17 Thread Emmanuel Engelhart
Le 16/10/2012 23:04, Platonides a écrit : > On 11/10/12 17:46, Strainu wrote: >> I did something last year for exporting the files from WLMRO to >> Europeana: >> http://code.google.com/p/wikiro/source/browse/trunk/robots/python/pywikipedia/monumente/europeana_image_list.py >> It was done very quic

Re: [Wikitech-l] Media Author/License information in the database

2012-10-11 Thread Emmanuel Engelhart
Le 11/10/2012 17:46, Strainu a écrit : > Having this information (along with other meta-data like coordinates > etc.) in the database and API would be useful I obviously agree, but I want to insist on one point: Author/license are not metadata like the others. Although it's *legal* to reuse/spread

[Wikitech-l] Media Author/License information in the database

2012-10-11 Thread Emmanuel Engelhart
Hi, I massively re-use medias from commons and I'm unable to simply (automatically) get the related author and license "attached" to each document I copy. As far as I know this is not possible (for example using the API, or dealing directly with information in the DB coming with the dumps). I'm s

Re: [Wikitech-l] GLAMwiki Toolset Project : Request for Comments - Technical Architecture

2012-09-26 Thread Emmanuel Engelhart
On 09/25/2012 12:17 PM, dan entous wrote: I thought the lab instance was only an incubation environment and that the final goal was to put gwtoolset on the WMF prod. servers. Isn't it? the final goal, as far as i understand it, is to have it run as its own application in its own environment, s

Re: [Wikitech-l] GLAMwiki Toolset Project : Request for Comments - Technical Architecture

2012-09-25 Thread Emmanuel Engelhart
/Request_for_Comments/Technical_Architecture. with kind regards, dan On Sep 24, 2012, at 9:54 PM, Emmanuel Engelhart wrote: Hi Dan, I have a few questions about the choice of the Zend Framework: * Why exactly using the Zend Framework? would like to use an open-source mvc framework that is used widely, has

Re: [Wikitech-l] GLAMwiki Toolset Project : Request for Comments - Technical Architecture

2012-09-24 Thread Emmanuel Engelhart
On 09/20/2012 04:34 PM, dan entous wrote: dear all, as some of you may already know, the GLAMwiki Toolset Project, http://outreach.wikimedia.org/wiki/GLAM/Toolset_project, is a collaboration between Wikimedia Nederland, Wikimedia UK, Wikimedia France and Europeana, with the goal of providing

Re: [Wikitech-l] HTML wikipedia dumps: Could you please provide them, or make public the code for interpreting templates?

2012-09-13 Thread Emmanuel Engelhart
Le 14/09/2012 05:26, Roberto Flores a écrit : > In all frankness, I don't see how can it be complicated or mind-blowing > to generate a HTML dump when the software is there already to produce a > wikimarkup one. > WikiMarkup dumps are of very limited use and mainly to yourselves alone. > > No need

Re: [Wikitech-l] HTML wikipedia dumps: Could you please provide them, or make public the code for interpreting templates?

2012-09-11 Thread Emmanuel Engelhart
Dear Roberto Le 09/09/2012 20:34, Roberto Flores a écrit : > I have developed an offline Wikipedia, Wikibooks, Wiktionary, etc. app for > the iPhone, which does a somewhat decent job at interpreting the wiki > markup into HTML. Great idea, but why reinventing the wheel concerning the format and n

[Wikitech-l] [KIWIX] 0.9 Release candidate 1 is out!

2012-07-16 Thread Emmanuel Engelhart
Hi We publish the first release candidate of Kiwix 0.9 (Kiwix 0.9 rc1). The most important improvements are: * Official support of Sugar * Official support of armv5 for kiwix-serve * Debian packages [1] * Multiple UI fixes on all systems * aria2c download process mgmt now 100% OK * Singleton-wind

Re: [Wikitech-l] Quick Mumbai hackathon followup

2011-12-06 Thread Emmanuel Engelhart
On 30/11/2011 04:28, Sumana Harihareswara wrote: > I'm asking Emmanuel to send an offline-related summary to > https://lists.wikimedia.org/mailman/listinfo/offline-l . The Hackathon was this time more open than for example in Berlin, so we had the visit of many people which mostly were at the same

Re: [Wikitech-l] Html dump for Wikipedia

2011-12-05 Thread Emmanuel Engelhart
On 03/12/2011 13:18, Tim Starling wrote: > On 03/12/11 08:58, Platonides wrote: >> On 02/12/11 22:33, Khalida BEN SIDI AHMED wrote: >>> Hello, >>> I need an html dump of Wikipedia but the link http://static.wikipedia.org/ >>> does >>> not work. >>> I'd appreciate any explanation or suggestion. >>>

Re: [Wikitech-l] [openZIM dev-l] Phonegap and Wikimedia mobile apps

2011-09-24 Thread Emmanuel Engelhart
On 24/09/2011 14:24, Christian Pühringer wrote: > The JAVA liblzma performance is pretty bad: To increase efficiency of > compression in the zim-format articles (and also all > other data like images) are stored in clusters. Cluster size is apparently > about > 1 MB. This implies that loading an

Re: [Wikitech-l] WMF XML dump title case problem

2011-06-26 Thread Emmanuel Engelhart
On 06/26/2011 05:22 PM, MZMcBride wrote: > Emmanuel Engelhart wrote: >> Titles should be stored in the table "page" with a first letter uppercased. >> http://en.wikipedia.org/wiki/Wikipedia:Naming_conventions_%28technical_restric >> tions%29#Lower_case_first_lette

[Wikitech-l] WMF XML dump title case problem

2011-06-26 Thread Emmanuel Engelhart
Sorry, now correctly cross posted. Emmanuel Original Message Subject:WMF XML dump title case problem Date: Sun, 26 Jun 2011 17:07:19 +0200 From: Emmanuel Engelhart To: Mailing list for Wikimedia CH , offlin...@lists.wikimedia.org Hi Titles should be stored

Re: [Wikitech-l] Question about external links CSS

2010-11-09 Thread Emmanuel Engelhart
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On 09/11/2010 20:21, Aryeh Gregor wrote: > On Tue, Nov 9, 2010 at 4:51 AM, wrote: >> Hi, >> >> FTP external link CSS looks like that: >> >> #bodyContent a.external[href ^="ftp://";], >> .link-ftp { >>background: url(file_icon.gif) center righ