Hello,

I've attached questions about the dump files pages-meta-history and
pages-meta-current.

Sorry for the details, questions are marked with (1) and (2).

Please, let me know, if I misunderstood something.

Thanks a lot.

===============================

(1) Is the following statement correct? The dump file
"pages-meta-history" lists all versions (revisions) of each page and
the dump file "pages-meta-current-yyyymmdd" contains for each page
only one of these versions (revisions): the last version of the page
before the day yyyymmdd


I try to test this statement on a small wiki and download from iawiki
(interlingua) these two files:
iawiki-20090201-pages-meta-history.xml.7z
iawiki-20080308-pages-meta-current.xml.bz2

Now I search for the page "Actualitates" in both files:

$ cat iawiki-20080308-pages-meta-current.xml | grep \<title | grep Actualitates
    <title>Actualitates</title>
    <title>Wikipedia:Actualitates</title>
    <title>Categoria:Actualitates</title>
    <title>Discussion:Actualitates</title>

$ 7z e -so iawiki-20090201-pages-meta-history.xml.7z | grep \<title |
grep Actualitates
    <title>Portal:Actualitates</title>
    <title>Categoria:Actualitates</title>
    <title>Discussion Portal:Actualitates</title>

A look at http://ia.wikipedia.org/wiki/Actualitates reveals that the
page "Actualitates" existed earlier on iawiki, but it has been
deleted.


(2) Is the following statement correct? : The dump file
"pages-meta-history-yyyymmdd" contains the histories of ___exactly___
those (not more and not fewer) pages that exist on the selected day
(yyyymmdd).

===============================

_______________________________________________
MediaWiki-l mailing list
MediaWiki-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l

Reply via email to