Hi,
        I am wondering if someone could help me create a list of Page Titles, 
and list of Redirect Pages from the wikidb database in MySQL. I have 
downloaded the XML/SQL dumps along with the SQL dumps of the remaining 
Tables and imported them into the wikidb database in MySQL. (This is the 
English version of Wikipedia.)
        
        I am looking at the Mediawiki Database Schema at 
http://upload.wikimedia.org/wikipedia/commons/4/41/Mediawiki-database-schema.png
 
. I also know that there is a list of Titles in NS0 provided along with 
the dumps. This does not suit my purpose, because I would like to have 
all the Titles i.e. including those that are out of NS0 too. I would 
also like to have the list of redirects and where they redirect to.

The Mediawiki Database Schema mentioned above, does provide a lot of 
information – but I am not experienced enough to make use of it. I hope 
some of you can help me.

        For the list of Page Titles, I looked into the Page Table, and simply 
got a list of all of the Titles. In SQL I assumed this would be the 
result of the Query: “select page_title from wikidb.page;” – The problem 
with this is that I am getting a number of Titles that are repeated?? So 
I think I am doing something wrong?

        As far as getting the List of Redirects, I hope someone would clarify 
if the way to do this would be to go through the Redirect Table, for 
each rd_from – look in the Page Table for the matching Page ID, and then 
get the Title from that row. This can be done using a Join – but I have 
not implemented this for now. Is this the correct way to go?

Thanks again to all you guys,
O. O.


_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to