On 10/1/06, Dean Buckner <[EMAIL PROTECTED]> wrote: > Darren Duncan writes > >>>>>> > Hm, it occurs to me that sometime there should be an analysis of how > much entertainment related articles are in Wikipedia vs articles that > are not entertainment related. > > There are Wikipedia 'categories' that allow you to do this, but the > problem is that the high-level, general categories are not on the > articles themselves. There is an internal hierarchy that guarantees > that 'London philosophers' belongs to 'English philosophers', and that > this belongs to 'Philosophy'. So in theory a call to Wikipedia's > database should allow you to get a list of all philosophy-related > articles within the hierarchy. > > But I don't know how to do this. Maybe it is not possible. I know for > sure it is NOT possible to do logical searches like all British > philosophers except London philosophers, because a lot of Wikipedia > editors wanted this facility, and the developers said it was not > possible without enhancements > > That is a major obstacle to finding out what is actually IN Wikipedia. > > Edward Buckner
The database software Wikipedia uses (MySQL) does not provide such searches (DB people call them intersections) rapidly. Debate is rather brisk on the wikitech mailing list. When we import dumps, we can import them into other databases with better indexing features such as Postgresql or commercial ones like MSSQL. We can run queries against the dumps once imported into temporary databases - just provide a criteria and some of us can give you the analysis you seek. -jtp _______________________________________________ Citizendium-l mailing list [email protected] https://lists.purdue.edu/mailman/listinfo/citizendium-l
