https://bugzilla.wikimedia.org/show_bug.cgi?id=26223

--- Comment #2 from Ariel T. Glenn <ar...@wikimedia.org> 2011-01-24 18:21:50 
UTC ---
I had a look at these.  I checked en wp carefully, and all of the incidents
that aren't pretty old, before a certain date in 2008 (i.e. rev 242099935 on,
and that's most of them) are moves.  It turns out that this is true for most
revs that I spot checked on the other projects as well.

So what happens with these moves?  There are two revisions with the same move
recorded in the log and the history; however three of them make it into the
revision table.

Here's a sample from en wp:

rev_id    | rev_page | rev_text_id | rev_comment 
398410443 | 11005908 |   399836293 | moved [[Wikipedia:Tutorial
(Editing)/sandbox]] to [[Wikipedia:Tutorial/Editing/sandbox]]...
398410417 | 11005908 |   399836293 | moved [[Wikipedia:Tutorial
(Editing)/sandbox]] to [[Wikipedia:Tutorial/Editing)/sandbox]]...

those show up in the history, and they are the "good" ones, as they have a page
id attached.  The "bad" one is

398410444 |        0 |   399843549 | moved [[Wikipedia:Tutorial
(Editing)/sandbox]] to [[Wikipedia:Tutorial/Editing/sandbox]]...

I looked at a number of these and they all display the same characteristics:

the third rev is the bad one, it has the same time stamp as the previous one,
and its text content is the redirect left behind by the move. 

Ie. the revision length of the bad one in the above is 48 and the text content
is #REDIRECT [[Wikipedia:Tutorial/Editing/sandbox]]
where the rev length of the other two revisions is 2806 and they contain the
actual page content.

This move issue is an outstanding issue, that is, it is not due to the master
slave issue we had recently or any of that, That's clear from the timestamps,
which in the above example well predate that outage.  In case someone might
think that the revision used to have the page id once upon a time and that the
corruption occurred later, I checked the history dumps from July and Sept of
last year for a couple of these revisions with earlier time stamps, and the two
good ones in each case appeared in the file but not the bad one.  That makes me
pretty sure this is a failure at the time of the move, and probably still a bug
in the code running now.

I hope that's enough information for someone who knows the innards of the
move/delete stuff to hazard a guess at the problem.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to