2009/10/23 Jona Christopher Sahnwaldt <jcsahnwa...@gmail.com>: > Because of result count restrictions, these queries don't > return all ISO language codes extracted by DBpedia, > but I think they give a good impression of the data quality > and coverage (or sometimes lack thereof): > > http://dbpedia.org/sparql?query=select+distinct+%3Fs%2C+%3Fo+where{%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2Fiso%3E+%3Fo+.%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2FwikiPageUsesTemplate%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTemplate%3Ainfobox_language%3E+.} > > http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&query=select+distinct+%3Fs%2C+%3Fo+where{%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2Fiso%3E+%3Fo+.%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2FwikiPageUsesTemplate%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTemplate%3Ainfobox_language%3E+.}
This is really amazing and I think I'm going to be learning the query language and possibly spending some time with dbpedia. Have you thought about doing the same for Wiktionary? Andrew Dunbar (hippietrail) > On Fri, Oct 23, 2009 at 18:20, Andrew Dunbar <hippytr...@gmail.com> wrote: >> 2009/10/23 Aryeh Gregor <simetrical+wikil...@gmail.com>: >>> On Fri, Oct 23, 2009 at 8:27 AM, Andrew Dunbar <hippytr...@gmail.com> wrote: >>>> Yes I found how to get it through the API now. It was actually just >>>> the Toolserver database that was intractably slow. >>> >>> There's nothing slow about the TS database here: >>> >>> mysql> pager true >>> PAGER set to 'true' >>> mysql> SELECT tl_from FROM templatelinks WHERE tl_namespace=10 AND >>> tl_title IN ('Infobox_Language', 'Infobox_language'); >>> 3144 rows in set (0.12 sec) >>> >>> Your query might have been what was slow. >> >> Yes I didn't specify tl_namespace and when I check for which columns >> have keys I could see none: >> mysql> describe templatelinks; >> +--------------+-----------------+------+-----+---------+-------+ >> | Field | Type | Null | Key | Default | Extra | >> +--------------+-----------------+------+-----+---------+-------+ >> | tl_from | int(8) unsigned | NO | | 0 | | >> | tl_namespace | int(11) | NO | | 0 | | >> | tl_title | varchar(255) | NO | | | | >> +--------------+-----------------+------+-----+---------+-------+ >> 3 rows in set (0.01 sec) >> >> But I don't know much about databases and SQL... >> >> I have reached an important milestone of extracting all the name value >> pairs for language infobox ISO 639 language codes and language family >> string by the way. >> >> But the values still need some work before I can try to match them >> against ISO 639-5 language family codes which is my ultimate goal. >> >> Thanks for all the tips. >> >> Andrew Dunbar (hippietrail) >> >>> _______________________________________________ >>> Wikitech-l mailing list >>> Wikitech-l@lists.wikimedia.org >>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l >>> >> >> >> >> -- >> http://wiktionarydev.leuksman.com http://linguaphile.sf.net >> >> _______________________________________________ >> Wikitech-l mailing list >> Wikitech-l@lists.wikimedia.org >> https://lists.wikimedia.org/mailman/listinfo/wikitech-l >> > > _______________________________________________ > Wikitech-l mailing list > Wikitech-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wikitech-l > -- http://wiktionarydev.leuksman.com http://linguaphile.sf.net _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l