Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Chengbin Zheng
On Tue, Sep 1, 2009 at 9:00 PM, Platonides platoni...@gmail.com wrote:

  Chengbin Zheng wrote:
  I bring this old issue up because I want to know if (or if not) progress
 (or
  plans) are made to update the static HTML version of Wikipedia.
  BH photos just leaked the next generation of Archos portable media
 players.
  Unbelievably, the rumors of a 500GB version is true! This is already
  tempting (especially the price at $420). Just waiting for specs on
 September
  15, the Archos event. I really hope it will support NTFS so I can use the
  compression feature.
 
  It would be really cool and convenient to have an offline copy of
 Wikipedia
  anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
 
  BTW, does anyone know what is the size of the current static HTML English
  Wikipedia version uncompressed? Thanks.

 I don't think a static dump is the best way to keep wikipedia on your hd.


 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l



It is the only way actually.  Although I'm curious on what other ways one
can use to keep Wikipedia

Archos PMPs are not computers, but they do have the ability to go on the
Internet, and read an HTML file offline through the hard drive.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Have wikimedia lucene-search config been published?

2009-09-02 Thread Anon Sricharoenchai
Then, could you please publish lsearch-global.conf onto NOC?

On 8/28/09, Robert Stojnic rainma...@gmail.com wrote:

  We could publish lsearch-global.conf since it doesn't contain any
  private info, and there is only one. However, lsearch.conf is different
  for various searchers and contains passwords to access the OAI feed.

  r.


  Anon Sricharoenchai wrote:
   Where can I see the lucene-search configurations (lsearch.conf,
   lsearch-global.conf) of wikimedia?
   Could you published them in http://noc.wikimedia.org/conf/ ?
  

  ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
  
  


  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] How we can speed up double brace substitution, Was: how to chang {{SITENAME}}

2009-09-02 Thread Ilmari Karonen
Robert Rohde wrote:
 
 As I suggested yesterday, perhaps not very clearly, I think the
 sensible thing to do is to bypass the Parser on most calls to messages
 like Welcome to {{SITENAME}} by caching the post-transformed version
 of the message, e.g Welcome to Wikipedia, in the MessageCache
 instead.  This should be possible if the message has no context
 dependent substitutions (e.g. if the message avoids things like
 substitution variables and context dependent magic words like
 {{NAMESPACE}}).
 
 It would mean teaching the parser to look for context dependent
 elements and having some way to communicate that to the MessageCache
 to allow it to decide whether to cache the pre-transformed or
 post-transformed version.  It would also mean categorizing magic words
 to identify whether caching is allowed, but that seems like a
 straightforward extension of the TTL hinting already done in
 MagicWord.  The magic word caching hints could also be used to help
 decide how long the post-transformed version is likely to be good for.

My (somewhat vague) idea of how this could be done would be to add a new 
constant-folding mode to the parser which only does brace substitution 
for magic words that it knows to be essentially constant (and whose 
parameters, if any, can be folded down to something that contains no 
braces, numbered message parameters nor anything else potentially 
variable) and leaves everything else alone.

It's been quite some time since I last looked at the parser code in any 
detail, but surely that can't be *that* hard -- after all, we already 
have the pre-save transform which does something quite similar.

The localization cache can just run everything through the parser in 
constant-folding mode, cache the output, and then use the current check 
(which I think simply checks for the presence of {{) to determine if 
the message needs to be reparsed when it's actually used.

(Although Roan's suggestion of also folding things like {{CURRENTYEAR}} 
and passing their expiration time to the cache may also be worth 
considering.  We could then just treat e.g. {{SITENAME}} as having an 
infinite expiration time, and any truly uncacheable magic words as 
expiring immediately.)

-- 
Ilmari Karonen

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Manuel Schneider
Hi Chengbin, hi list,

static.wikimedia.org is currently not being updated and while the dumps 
processing has been assigned to and completely rewritten by Tomasz Finc 
(developer at WMF), there has not been made any assignment concerning HTML 
dumps.

We had a Wikipedia Offline meeting at Wikimania last week and discussed 
several issues. One issue is the fact, that WMF wants to see the ZIM file 
format being used for offline dumps and has suggested to include it into the 
regular dumping process.
So one question was: When will that happen, what is the status of WMF ZIM 
dumping? 
As ZIM uses HTML extracts Tomasz clarified that once static.wikimedia.org has 
been rebuild to be stable and sutainable, integrating ZIM would be trivial. 
But he also informed us that this task has not yet been assigned.

As Brion Vibber and Erik Möller have been at the meeting as well we hope that 
this assignment will be made soon and this task has got higher priority.

This said I may also advise you not to you use the pure HTML dumps but the ZIM 
files for your Archos, because that's what they are meant for.
A ZIM file containing all german Wikipedia articles (900,000) is 1,4 GB, an 
additional full text search index takes another 1 GB.

Greets,


Manuel



Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
 I bring this old issue up because I want to know if (or if not) progress
 (or plans) are made to update the static HTML version of Wikipedia.
 BH photos just leaked the next generation of Archos portable media
 players. Unbelievably, the rumors of a 500GB version is true! This is
 already tempting (especially the price at $420). Just waiting for specs on
 September 15, the Archos event. I really hope it will support NTFS so I can
 use the compression feature.

 It would be really cool and convenient to have an offline copy of Wikipedia
 anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?

 BTW, does anyone know what is the size of the current static HTML English
 Wikipedia version uncompressed? Thanks.
 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l



-- 
Regards
Manuel Schneider

Wikimedia CH - Verein zur Förderung Freien Wissens
Wikimedia CH - Association for the advancement of free knowledge
www.wikimedia.ch

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] How we can speed up double brace substitution, Was: how to chang {{SITENAME}}

2009-09-02 Thread Niklas Laxström
2009/9/2 Ilmari Karonen nos...@vyznev.net:
 (Although Roan's suggestion of also folding things like {{CURRENTYEAR}}
 and passing their expiration time to the cache may also be worth
 considering.  We could then just treat e.g. {{SITENAME}} as having an
 infinite expiration time, and any truly uncacheable magic words as
 expiring immediately.)

When using CDB, the cache is constant. You can't do incremental
updates. And what if WMF uses *single* cache for all projects? Then
{{SITENAME}} wouldn't be constant anymore, and we would not get better
performance where it is needed most. It could help other MediaWiki
installations a bit, just how much... I don't know.

-- 
Niklas Laxström

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Manuel Schneider
Hi Chengbin,

ZIM is an upcoming standard for using HTML contents offline. It is derived 
from the Zeno file format used on the german Wikipedia DVDs since 2006 (ZIM = 
Zeno IMproved).

There are currently several reader applications for it, for instance the 
zimreader made by the openZIM project or Kiwix.
There are some ports around like Kiwix on Windows and zimreader on openmoko / 
ARM.

The zimreader by openZIM works like a small webserver, it serves the contents 
of the ZIM file locally.

Once the HTML dump on static.wikimedia.org is fixed and ZIM file creation has 
been integrated you will be able to download fresh ZIM files of all Wikimedia 
projects directly from download.wikimedia.org.

Currently the Kiwix team has created some ZIM files and we try to build a ZIM 
file directory:
http://openzim.org/ZIM_File_Archive

ZIM actually stores the article text portion of the HTML output of the Wiki in 
a compressed cluster. It can hold also all type of other MIME types such as 
images, CSS files etc.
http://openzim.org/ZIM_File_Format

It is an open standard and has currently been developed and implemented by the 
openZIM team (sponsored by Wikimedia CH) in C++. There is a library (zimlib) 
which can be integrated in other reader or dumping applications to make them 
ZIM-aware.

Using the open documentation ZIM can be implemented in any other language as 
well.
The idea of ZIM is to make the data files freely interchangeable with any 
reader application. It is also flexible enough to store other works than only 
data from Wikipedia/MediaWiki. Then it tries to keep the reader application 
as simple and stupid as possible. There is only uncompression and HTML 
rendering to be done while a HTML renderer should be available on nearly all 
devices.

Greets,


Manuel


Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
 On Wed, Sep 2, 2009 at 8:13 AM, Manuel Schneider 

 manuel.schnei...@wikimedia.ch wrote:
  Hi Chengbin, hi list,
 
  static.wikimedia.org is currently not being updated and while the dumps
  processing has been assigned to and completely rewritten by Tomasz Finc
  (developer at WMF), there has not been made any assignment concerning
  HTML dumps.
 
  We had a Wikipedia Offline meeting at Wikimania last week and discussed
  several issues. One issue is the fact, that WMF wants to see the ZIM file
  format being used for offline dumps and has suggested to include it into
  the
  regular dumping process.
  So one question was: When will that happen, what is the status of WMF ZIM
  dumping?
  As ZIM uses HTML extracts Tomasz clarified that once
  static.wikimedia.orghas been rebuild to be stable and sutainable,
  integrating ZIM would be trivial. But he also informed us that this task
  has not yet been assigned.
 
  As Brion Vibber and Erik Möller have been at the meeting as well we hope
  that
  this assignment will be made soon and this task has got higher priority.
 
  This said I may also advise you not to you use the pure HTML dumps but
  the ZIM
  files for your Archos, because that's what they are meant for.
  A ZIM file containing all german Wikipedia articles (900,000) is 1,4 GB,
  an
  additional full text search index takes another 1 GB.
 
  Greets,
 
 
  Manuel
 
  Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
   I bring this old issue up because I want to know if (or if not)
   progress (or plans) are made to update the static HTML version of
   Wikipedia. BH photos just leaked the next generation of Archos
   portable media players. Unbelievably, the rumors of a 500GB version is
   true! This is already tempting (especially the price at $420). Just
   waiting for specs
 
  on
 
   September 15, the Archos event. I really hope it will support NTFS so I
 
  can
 
   use the compression feature.
  
   It would be really cool and convenient to have an offline copy of
 
  Wikipedia
 
   anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
  
   BTW, does anyone know what is the size of the current static HTML
   English Wikipedia version uncompressed? Thanks.
   ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
  --
  Regards
  Manuel Schneider
 
  Wikimedia CH - Verein zur Förderung Freien Wissens
  Wikimedia CH - Association for the advancement of free knowledge
  www.wikimedia.ch
 
  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l

 I'm not familiar with the file extension .zim. What is that? Some sort of
 compressed html format like .chm? Where can I get a .zim file? I need to
 get check if this format is compatible with my Archos's Opera browser.
 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l



-- 
Regards

Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Chengbin Zheng
On Wed, Sep 2, 2009 at 8:45 AM, Manuel Schneider 
manuel.schnei...@wikimedia.ch wrote:

 Hi Chengbin,

 ZIM is an upcoming standard for using HTML contents offline. It is derived
 from the Zeno file format used on the german Wikipedia DVDs since 2006 (ZIM
 =
 Zeno IMproved).

 There are currently several reader applications for it, for instance the
 zimreader made by the openZIM project or Kiwix.
 There are some ports around like Kiwix on Windows and zimreader on openmoko
 /
 ARM.

 The zimreader by openZIM works like a small webserver, it serves the
 contents
 of the ZIM file locally.

 Once the HTML dump on static.wikimedia.org is fixed and ZIM file creation
 has
 been integrated you will be able to download fresh ZIM files of all
 Wikimedia
 projects directly from download.wikimedia.org.

 Currently the Kiwix team has created some ZIM files and we try to build a
 ZIM
 file directory:
 http://openzim.org/ZIM_File_Archive

 ZIM actually stores the article text portion of the HTML output of the Wiki
 in
 a compressed cluster. It can hold also all type of other MIME types such as
 images, CSS files etc.
 http://openzim.org/ZIM_File_Format

 It is an open standard and has currently been developed and implemented by
 the
 openZIM team (sponsored by Wikimedia CH) in C++. There is a library
 (zimlib)
 which can be integrated in other reader or dumping applications to make
 them
 ZIM-aware.

 Using the open documentation ZIM can be implemented in any other language
 as
 well.
 The idea of ZIM is to make the data files freely interchangeable with any
 reader application. It is also flexible enough to store other works than
 only
 data from Wikipedia/MediaWiki. Then it tries to keep the reader application
 as simple and stupid as possible. There is only uncompression and HTML
 rendering to be done while a HTML renderer should be available on nearly
 all
 devices.

 Greets,


 Manuel


 Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
  On Wed, Sep 2, 2009 at 8:13 AM, Manuel Schneider 
 
  manuel.schnei...@wikimedia.ch wrote:
   Hi Chengbin, hi list,
  
   static.wikimedia.org is currently not being updated and while the
 dumps
   processing has been assigned to and completely rewritten by Tomasz Finc
   (developer at WMF), there has not been made any assignment concerning
   HTML dumps.
  
   We had a Wikipedia Offline meeting at Wikimania last week and discussed
   several issues. One issue is the fact, that WMF wants to see the ZIM
 file
   format being used for offline dumps and has suggested to include it
 into
   the
   regular dumping process.
   So one question was: When will that happen, what is the status of WMF
 ZIM
   dumping?
   As ZIM uses HTML extracts Tomasz clarified that once
   static.wikimedia.orghas been rebuild to be stable and sutainable,
   integrating ZIM would be trivial. But he also informed us that this
 task
   has not yet been assigned.
  
   As Brion Vibber and Erik Möller have been at the meeting as well we
 hope
   that
   this assignment will be made soon and this task has got higher
 priority.
  
   This said I may also advise you not to you use the pure HTML dumps but
   the ZIM
   files for your Archos, because that's what they are meant for.
   A ZIM file containing all german Wikipedia articles (900,000) is 1,4
 GB,
   an
   additional full text search index takes another 1 GB.
  
   Greets,
  
  
   Manuel
  
   Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
I bring this old issue up because I want to know if (or if not)
progress (or plans) are made to update the static HTML version of
Wikipedia. BH photos just leaked the next generation of Archos
portable media players. Unbelievably, the rumors of a 500GB version
 is
true! This is already tempting (especially the price at $420). Just
waiting for specs
  
   on
  
September 15, the Archos event. I really hope it will support NTFS so
 I
  
   can
  
use the compression feature.
   
It would be really cool and convenient to have an offline copy of
  
   Wikipedia
  
anywhere I go without the need of Wi-Fi. What am I gonna do with
 500GB?
   
BTW, does anyone know what is the size of the current static HTML
English Wikipedia version uncompressed? Thanks.
___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
  
   --
   Regards
   Manuel Schneider
  
   Wikimedia CH - Verein zur Förderung Freien Wissens
   Wikimedia CH - Association for the advancement of free knowledge
   www.wikimedia.ch
  
   ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
  I'm not familiar with the file extension .zim. What is that? Some sort of
  compressed html format like .chm? Where can I get a .zim file? I need to
  get check if this format is 

Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Manuel Schneider
Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
 Well, as I said, Archos devices are not computers. They're merely portable
 video players with an internet browser. That's why I seek the static HTML
 version of Wikipedia.

I see. But maybe it is possible to install a reader or at least the zimreader 
as webserver, which can be used with the built-in browser. At least it won't 
take much ressources.

 Will there be easy extraction of zim to HTML? Extracting a dump is too
 difficult.

Of course it is possible, that's exactly what the zimreader does when serving 
pages. But as the HTML dump working on Wikimedia clusters is a requirement 
for ZIM file creation you can just go with the HTML dump from there as well.

Greets,


Manuel
-- 
Regards
Manuel Schneider

Wikimedia CH - Verein zur Förderung Freien Wissens
Wikimedia CH - Association for the advancement of free knowledge
www.wikimedia.ch

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


[Wikitech-l] 5 millions

2009-09-02 Thread zh509
Hi, everyone,

Wikimedia Commons, the media repository site used by Wikipedia, today just 
reached the 5 million media files milestone. Every one of these media files 
is available under a free license, such that anyone can use them for any 
purpose. Wikimedia Commons is the largest free media repository on the 
internet.

Zeyi He 

Wikimedia UK

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Gerard Meijssen
Hoi,
For you information Okawix is localised at translatewiki.net.
Thanks,
 GerardM

http://translatewiki.net/wiki/Translating:Okawix

2009/9/2 Manuel Schneider manuel.schnei...@wikimedia.ch

 Hi Chengbin, hi list,

 static.wikimedia.org is currently not being updated and while the dumps
 processing has been assigned to and completely rewritten by Tomasz Finc
 (developer at WMF), there has not been made any assignment concerning HTML
 dumps.

 We had a Wikipedia Offline meeting at Wikimania last week and discussed
 several issues. One issue is the fact, that WMF wants to see the ZIM file
 format being used for offline dumps and has suggested to include it into
 the
 regular dumping process.
 So one question was: When will that happen, what is the status of WMF ZIM
 dumping?
 As ZIM uses HTML extracts Tomasz clarified that once static.wikimedia.orghas
 been rebuild to be stable and sutainable, integrating ZIM would be trivial.
 But he also informed us that this task has not yet been assigned.

 As Brion Vibber and Erik Möller have been at the meeting as well we hope
 that
 this assignment will be made soon and this task has got higher priority.

 This said I may also advise you not to you use the pure HTML dumps but the
 ZIM
 files for your Archos, because that's what they are meant for.
 A ZIM file containing all german Wikipedia articles (900,000) is 1,4 GB,
 an
 additional full text search index takes another 1 GB.

 Greets,


 Manuel



 Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
  I bring this old issue up because I want to know if (or if not) progress
  (or plans) are made to update the static HTML version of Wikipedia.
  BH photos just leaked the next generation of Archos portable media
  players. Unbelievably, the rumors of a 500GB version is true! This is
  already tempting (especially the price at $420). Just waiting for specs
 on
  September 15, the Archos event. I really hope it will support NTFS so I
 can
  use the compression feature.
 
  It would be really cool and convenient to have an offline copy of
 Wikipedia
  anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
 
  BTW, does anyone know what is the size of the current static HTML English
  Wikipedia version uncompressed? Thanks.
  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l



 --
 Regards
 Manuel Schneider

 Wikimedia CH - Verein zur Förderung Freien Wissens
 Wikimedia CH - Association for the advancement of free knowledge
 www.wikimedia.ch

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Manuel Schneider
I want to add that Okawix uses code from the pre-ZIM GPL'ed ZenoReader and 
ZenoWriter which has been developed by the openZIM team before we started 
ZIM, but they changed it to be incompatible with Zeno and ZIM.

So Okawix can be regarded to be as proprietary as well as also a GPL 
violation.

We have been contacted by Linterweb (the company behind Okawix) several times 
and we also invited them to the developers meeting, but actually they do not 
seem to be able for a collaboration with an open source community.
The Wikimedia Foundation had a similar experience when trying to work with 
them.

/Manuel


Am Mittwoch, 2. September 2009 schrieb Gerard Meijssen:
 Hoi,
 For you information Okawix is localised at translatewiki.net.
 Thanks,
  GerardM

 http://translatewiki.net/wiki/Translating:Okawix

 2009/9/2 Manuel Schneider manuel.schnei...@wikimedia.ch

  Hi Chengbin, hi list,
 
  static.wikimedia.org is currently not being updated and while the dumps
  processing has been assigned to and completely rewritten by Tomasz Finc
  (developer at WMF), there has not been made any assignment concerning
  HTML dumps.
 
  We had a Wikipedia Offline meeting at Wikimania last week and discussed
  several issues. One issue is the fact, that WMF wants to see the ZIM file
  format being used for offline dumps and has suggested to include it into
  the
  regular dumping process.
  So one question was: When will that happen, what is the status of WMF ZIM
  dumping?
  As ZIM uses HTML extracts Tomasz clarified that once
  static.wikimedia.orghas been rebuild to be stable and sutainable,
  integrating ZIM would be trivial. But he also informed us that this task
  has not yet been assigned.
 
  As Brion Vibber and Erik Möller have been at the meeting as well we hope
  that
  this assignment will be made soon and this task has got higher priority.
 
  This said I may also advise you not to you use the pure HTML dumps but
  the ZIM
  files for your Archos, because that's what they are meant for.
  A ZIM file containing all german Wikipedia articles (900,000) is 1,4 GB,
  an
  additional full text search index takes another 1 GB.
 
  Greets,
 
 
  Manuel
 
  Am Mittwoch, 2. September 2009 schrieb Chengbin Zheng:
   I bring this old issue up because I want to know if (or if not)
   progress (or plans) are made to update the static HTML version of
   Wikipedia. BH photos just leaked the next generation of Archos
   portable media players. Unbelievably, the rumors of a 500GB version is
   true! This is already tempting (especially the price at $420). Just
   waiting for specs
 
  on
 
   September 15, the Archos event. I really hope it will support NTFS so I
 
  can
 
   use the compression feature.
  
   It would be really cool and convenient to have an offline copy of
 
  Wikipedia
 
   anywhere I go without the need of Wi-Fi. What am I gonna do with 500GB?
  
   BTW, does anyone know what is the size of the current static HTML
   English Wikipedia version uncompressed? Thanks.
   ___
   Wikitech-l mailing list
   Wikitech-l@lists.wikimedia.org
   https://lists.wikimedia.org/mailman/listinfo/wikitech-l
 
  --
  Regards
  Manuel Schneider
 
  Wikimedia CH - Verein zur Förderung Freien Wissens
  Wikimedia CH - Association for the advancement of free knowledge
  www.wikimedia.ch
 
  ___
  Wikitech-l mailing list
  Wikitech-l@lists.wikimedia.org
  https://lists.wikimedia.org/mailman/listinfo/wikitech-l

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l



-- 
Regards
Manuel Schneider

Wikimedia CH - Verein zur Förderung Freien Wissens
Wikimedia CH - Association for the advancement of free knowledge
www.wikimedia.ch

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

[Wikitech-l] Exclude entries from RecentChange and NewPages list

2009-09-02 Thread Uwe Baumbach
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Hello,

we try to exclude from
- - RSS/Atom feeds of RC and New Pages list generally and
- - Special Pages of RC and New Pages for users not in a custom user group

all entries or __at least__ the diffs and new page content for all pages
of a custom namespace.

Where is the right place to hook in and check the entries (or part of
content for them) to be potentially excluded?

Where can we configure Feed content/structure?

Uwe (Baumbach)
u.baumb...@web.de


Besuchen Sie den 61. Deutschen Genealogentag!
11.-14. September 2009
http://www.genealogentag.de
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.7 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iD8DBQFKnoHNFEbayCH8zXkRAvbiAJ9O+rhrpVolxSscs0Xvl7QkndM7QwCg8oY8
vpHzObrmzt5mZlPGMwn0KJY=
=AnhX
-END PGP SIGNATURE-

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Roan Kattouw
2009/9/2 Gerard Meijssen gerard.meijs...@gmail.com:
 Hoi,
 Why do you say that it is proprietary and why do you state that there is a
 GPL violation ? Making accusations like this without providing evidence is
 not what I expect.

[snip]

 2009/9/2 Manuel Schneider manuel.schnei...@wikimedia.ch
 I want to add that Okawix uses code from the pre-ZIM GPL'ed ZenoReader and
 ZenoWriter which has been developed by the openZIM team before we started
 ZIM, but they changed it to be incompatible with Zeno and ZIM.


Taking code from a GPLed project and putting it in a non-GPLed (or
non-GPL-compatible) project is a violation of the GPL.

Roan Kattouw (Catrope)

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Siebrand Mazeland
-Oorspronkelijk bericht-
Van: Roan Kattouw
Verzonden: woensdag 2 september 2009 18:10

 Taking code from a GPLed project and putting it in a non-GPLed (or
 non-GPL-compatible) project is a violation of the GPL.

Well, that is stating the obvious. The question is: what makes Okawix
violate the GPL license? Or was it slander?

Siebrand


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] Any news to update static HTML Wikipedia?

2009-09-02 Thread Gerard Meijssen
Hoi,
I referred to a place where the code can be found. The code states that it
is GPL code. So what are you saying... That it is not ???
Thanks,
   GerardM

2009/9/2 Roan Kattouw roan.katt...@gmail.com

 2009/9/2 Gerard Meijssen gerard.meijs...@gmail.com:
  Hoi,
  Why do you say that it is proprietary and why do you state that there is
 a
  GPL violation ? Making accusations like this without providing evidence
 is
  not what I expect.

 [snip]

  2009/9/2 Manuel Schneider manuel.schnei...@wikimedia.ch
  I want to add that Okawix uses code from the pre-ZIM GPL'ed ZenoReader
 and
  ZenoWriter which has been developed by the openZIM team before we
 started
  ZIM, but they changed it to be incompatible with Zeno and ZIM.
 

 Taking code from a GPLed project and putting it in a non-GPLed (or
 non-GPL-compatible) project is a violation of the GPL.

 Roan Kattouw (Catrope)

 ___
 Wikitech-l mailing list
 Wikitech-l@lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikitech-l

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] 5 millions

2009-09-02 Thread Robert Rohde
On Wed, Sep 2, 2009 at 6:14 AM, zh...@york.ac.uk wrote:
 Hi, everyone,

 Wikimedia Commons, the media repository site used by Wikipedia, today just
 reached the 5 million media files milestone. Every one of these media files
 is available under a free license, such that anyone can use them for any
 purpose. Wikimedia Commons is the largest free media repository on the
 internet.

Not counting the ~5000 non-free files that are currently identified as
Copyright by Wikimedia.

And not worrying about the 18M CC-BY / CC-BY-SA images on Flickr
(which is arguably still the largest free content image repository,
though that's not the only way Flickr is used).

-Robert Rohde

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] How we can speed up double brace substitution, Was: how to chang {{SITENAME}}

2009-09-02 Thread Platonides
Niklas Laxström wrote:
 Localisation cache caches only static content, not in-wiki
 customisations. On the other hand, it needs some trickery if the set
 of extensions differ between wikis, but should still be possible. But
 I'm just guessing, I don't know how they are going to set it up.

How is it going to improve the current message files?


___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l


Re: [Wikitech-l] How we can speed up double brace substitution, Was: how to chang {{SITENAME}}

2009-09-02 Thread Aryeh Gregor
On Wed, Sep 2, 2009 at 8:34 AM, Niklas
Laxströmniklas.laxst...@gmail.com wrote:
 When using CDB, the cache is constant. You can't do incremental
 updates.

You can, you just have to write an entirely new database every time.
This will already have to be done every time the messages change.
Changes to {{SITENAME}} should be much rarer, so it shouldn't be a big
burden.

 And what if WMF uses *single* cache for all projects?

If the cache is only for default messages (is it?), then of course it
doesn't help here.  We could still use memcached.

On Wed, Sep 2, 2009 at 6:51 PM, Platonidesplatoni...@gmail.com wrote:
 How is it going to improve the current message files?

Because it would be able to read only the messages it needed and not
have to execute megabytes of useless PHP code on every request, I
assume.

___
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l