https://bugzilla.wikimedia.org/show_bug.cgi?id=34568

--- Comment #4 from Aaron Schulz <aschulz4...@gmail.com> 2012-07-14 03:35:45 
UTC ---
(In reply to comment #3)
> PendingChanges uses either the flaggedpages table (pending items are indexed)
> or the flaggedpages_pending (which is a denormalization) tables. Both of which
> work well for this purpose, and don't require scanning massive portions of the
> page table. The number of pending pages tends to be from zero to a few
> dozen-thousand. Filtering by namespaces is thus easy.
> 
> The only way to do this well is to page on page_id rather than page_title. It
> would be confusing to page differently based on whether an namespace is
> provided, so we would have to always page by page_id, which would be slower if
> I specify a namespace that's only gets a small portion of edits.
> 
> UnreviewedPages is really only useful for getting really old pages that still
> haven't been reviewed. It seems like NewPages would be more useful (filtered
> for unpatrolled). Maybe that can have a category selector?

Another problem with paging on page_id is that it increases the average number
of rows to be scanned when "oldest" is selected, since older pages are more
likely to be reviewed (whereas paging on page_title is more random).

Another trick would be to still page on page_title but do a UNION query (like
recentchanges) on all reviewable namespaces. Given how many rows this special
page already has to scan, I'd be a bit hesitant to do that either, though it
seems to still be fairly fast on dewiki.

-- 
Configure bugmail: https://bugzilla.wikimedia.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug.
You are on the CC list for the bug.

_______________________________________________
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l

Reply via email to