[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2013-09-17 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Gabriel Wicke  changed:

   What|Removed |Added

 CC||gwi...@wikimedia.org
   See Also||https://bugzilla.wikimedia.
   ||org/show_bug.cgi?id=851

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2013-04-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Andre Klapper  changed:

   What|Removed |Added

   Keywords|bugsmash|

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are watching all bug changes.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2012-10-29 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Nemo_bis  changed:

   What|Removed |Added

 Blocks||41492

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2012-06-11 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

rai...@rillke.eu  changed:

   What|Removed |Added

 Blocks||37465

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-05-09 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Mark A. Hershberger  changed:

   What|Removed |Added

   Keywords||bugsmash

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-05-09 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Mark A. Hershberger  changed:

   What|Removed |Added

   Keywords|triage  |

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-05-06 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Mark A. Hershberger  changed:

   What|Removed |Added

   Keywords||triage
 CC||m...@everybody.org

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-04-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

John Mark Vandenberg  changed:

   What|Removed |Added

 Blocks||26122

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-04-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

John Mark Vandenberg  changed:

   What|Removed |Added

 Blocks||28047

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-04-20 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

John Mark Vandenberg  changed:

   What|Removed |Added

 Blocks||18493

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2011-03-21 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Dmcdevit  changed:

   What|Removed |Added

 Blocks||18840

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-07-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

--- Comment #12 from FT2  2010-07-09 04:02:57 UTC ---
As an interim option, can we at least have "&oldid=" work with
Special:Undelete?

Most deleted revisions have a revision id and the field is indexed. The few old
deleted revisions that don't have a revision id can easily be given one.

Revid/oldid is universally used everywhere else to identify a revision, except
when it comes to deleted revisions. It would be a fairly simple change to have
Mediawiki correctly handle links of the form:
http://en.wikipedia.org/w/index.php?title=Special:Undelete&oldid=12345

as an equally valid alternative to the existing:
http://en.wikipedia.org/w/index.php?title=Special:Undelete&target=PAGENAMEƗtamp=TIMESTAMP

Advantages:

1/ Whether a revision is deleted or undeleted the revisionid stays the same so
it can still be used to by admins to find the revision;

2/ Allows a failback to be added to Mediawiki that if &oldid= is given in a
link, and the revision isn't in the current revisions table, it can easily and
automatically be searched for in the deleted revisions table instead (and vice
versa), so diff links will "dead end" less often.

3/ It's simple to do and probably trivial on effort (if Special:Undelete args
include a page+timestamp look up the data by those, if the args are an oldid
then look up the data by that in the same table);

4/ Makes it easier to transition in future to using rev_id as the common index
field for revisions whether deleted or visible, which is a simplifying
direction for Mediawiki. Pagename/timestamp would continue to work so nothing
"breaks", but allowing oldid to work will make a future transition easier.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-07-08 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

--- Comment #11 from John Mark Vandenberg  2010-07-09 
00:17:27 UTC ---
*** Bug 23695 has been marked as a duplicate of this bug. ***

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-05-12 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Alexandre Emsenhuber [IAlex]  changed:

   What|Removed |Added

 Blocks||23489

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-02-15 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Guillaume Paumier  changed:

   What|Removed |Added

 CC||gpaum...@wikimedia.org

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are watching all bug changes.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-02-06 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

Aryeh Gregor  changed:

   What|Removed |Added

 CC||simetrical+wikib...@gmail.c
   ||om

--- Comment #10 from Aryeh Gregor  2010-02-06 
23:55:27 UTC ---
(In reply to comment #7)
> (The
> explanation for the other 185 k isn't clear. Delete/restore activity on old
> revisions??) 

AUTOINCREMENT columns aren't guaranteed to be allocated sequentially; values
can be skipped.  In particular, if a transaction inserts a row, then gets
rolled back rather than committed, there will be a gap, since the id is
assigned at insert time and not at commit time.  Autoincrementing is visible
immediately, even across transaction boundaries -- which would be a violation
of transactional semantics if id's were guaranteed to be sequential, but they
aren't.

(This probably doesn't account for *that* many missing revisions, though.)

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are watching all bug changes.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-02-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

OverlordQ  changed:

   What|Removed |Added

 CC||overlo...@gmail.com

--- Comment #9 from OverlordQ  2010-02-05 16:24:44 UTC ---
The duplicate revision id's I've opened as bug 22392, I put it in Wikimedia as
I wasn't 100% on the product.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are watching all bug changes.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-02-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

--- Comment #8 from Graham87  2010-02-05 16:04:24 UTC ---
The 185 k leftover revision ID's are probably due to the fact that the deletion
archive was cleared twice, in June 2004 and December 2003. It was created in
August 2002, so a negligible number of edits were permanently deleted. See:
http://en.wikipedia.org/wiki/Wikipedia:Viewing_and_restoring_deleted_pages#Deletion_archive

The numbers surprise me a bit. They seem to indicate that about three times as
many edits were deleted between June 2004 and June 2005 than in the entire
period up to June 2004. Perhaps revision ID numbers were reset or reused at
some point; It'd be best to ask Brion or Tim about this. The numbers are
particularly surprising because those old deleted revisions would presumably
include edits to the Wikipedia sandbox, which was routinely moved to bizarre
titles by newbies or outright deleted; a page move of a page with many
revisions in MW 1.4 and below was a much bigger disaster than it is now, and
move protection wasn't introduced until December 2004, see:
http://en.wikipedia.org/wiki/Wikipedia_talk:Protected_page/Archive2#Protection_against_page_moves

I once rescued some old sandbox revisions that go back to June 2004; the ones
before then were deleted and are now irretrievable. I used to believe that
there were about 50,000 irretrievable sandbox revisions, but with the numbers
you have presented here, that may be an exaggeration. For the old sandbox
edits, see:
http://en.wikipedia.org/wiki/Wikipedia:Historical_archive/Sandbox

Can I have an example of some revisions having the same ID in the deletion and
regular tables? That is very bizarre.

Out-of-order revision IDs cause problems for diffs; the number of intermediate
revisions is misreported (as that function works by rev ID), and since the
prev/next links in diffs are also ordered by rev ID, they are also affected.
See this diff as an example:
http://en.wikipedia.org/w/index.php?title=Talk:Netherlands&diff=11427366&oldid=229588957

More rarely, revisions have the opposite problem; they have the correct
revision ID vbut the wrong date. See bug 2219.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are watching all bug changes.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2010-02-05 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104

--- Comment #7 from FT2  2010-02-05 09:42:08 UTC ---
OverlordQ and I took a look at this on the toolserver. Some of this may be
obvious or well known - I don't know how much MediaWiki stuff from 2005 would
be "common knowledge".

The latest enwiki deleted revision with no rev_id is timestamped 20050627053602
(June 27 2005, 5.36 am) as Aaron and Graham say. 511728 deleted revisions have
no rev_id. 

(Around 2356 revisions also have an entry with the same rev_id in both current
and deleted revisions tables. This is presumably due to old data slippage.)

Deleted revisions from before June 2005 which were restored apparently got
allocated a new rev_id. Eg, compare the dates for enwiki revision id's 1570
(June 14 2005), 1571 (June 9 2005), 1630 (May 1 2005), and 1701
(Dec 8 2004). It doesn't seem to cause problems though.

There appear to have been around ~ 17,739,500 revisions on enwiki prior to the
changeover of June 2005. Because rev_ids were reallocated you have to go back
and forth by 50 or 100 at a time to get an idea what rev__id was reached at
roughly what sort of time. It turns out that there were about 17.74 m enwiki
revisions at the changeover.

Oversight and Developer deletions would have been negligible up to that point.
So in principle, there were approximately 17.7 m revisions prior to the
changeover of which 17,043,322 can be traced to "Live data", leaving 696k
revision ids untraced.

The conclusion is that the 511 k of old deleted revisions with rev_id = NULL
can be sequenced into the 17.7 m known rev_ids prior to the changeover, and
there are 696 k rev_ids of deleted revisions which they map into. (The
explanation for the other 185 k isn't clear. Delete/restore activity on old
revisions??) 

It looks like all the deleted revisions with a null value can be matched fairly
accurately by time order against existing gaps in the current revisions and
assigned a suitable rev_id that's currently not taken. It might not be perfect
but it'll be close, and allocating a time-sequenced old rev_id is probably
helpful for admins and the like.

Deletions are quite interspersed with undeleted revisions so this isn't a job
requiring human guesswork. It could be a once-off task suited to a script.

This would at least mean every deleted revision had a rev_id, which is a first
step in fixing the problems.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are watching all bug changes.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-10-28 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104





--- Comment #6 from Graham87   2009-10-28 10:21:57 UTC ---
Yeah, probably. If this is done, they'll most likely get a high rev ID as if
they're a new edit ... about 323,000,000 or so. It would be nice if they could
get their old rev_ID's from when they weren't deleted, but I don't think that's
possible.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-10-28 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104





--- Comment #5 from FT2   2009-10-28 10:03:49 UTC ---
Presumably that can be fixed by undelete and redelete then? Or a once-off
script that fills that field?


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-10-27 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104


Graham87  changed:

   What|Removed |Added

 CC||graha...@jazi.net




--- Comment #4 from Graham87   2009-10-28 05:53:03 UTC ---
More specifically, any revs in the archive table that were deleted before
Wikipedia was upgraded to MediaWiki 1.5 in June 2005, which were never
undeleted after the upgrade, will have a nul ar_rev_id value.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-10-25 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104





--- Comment #3 from FT2   2009-10-25 15:47:55 UTC ---
See also bug 21279.


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-03-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104





--- Comment #2 from Aaron Schulz   2009-03-22 13:47:13 
UTC ---
Note that many older revs have a NULL ar_rev_id value


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-03-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104


Aaron Schulz  changed:

   What|Removed |Added

 CC||jschulz_4...@msn.com
   Keywords||schema-change




-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 18104] Schema request change so deleted edits are identified by revisionID not timestamp (prevents DIFFs from breaking)

2009-03-22 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=18104


FT2  changed:

   What|Removed |Added

 CC||ft2.w...@gmail.com




--- Comment #1 from FT2   2009-03-22 13:21:02 UTC ---
(Note:  This is somewhat related to, but not the same as, bug 18068. 

They overlap in that in each case the request involves ability to view deleted
diffs, but in this case "traditional deletion" also identifies the diffs in a
manner that breaks linkage from diff (revisionID) to deleted diff, and makes it
very hard to compare two arbitrary diffs if one or both are "traditionally
deleted".


-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
--- You are receiving this mail because: ---
You are the assignee for the bug.
You are on the CC list for the bug.

___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l