2009/10/23 Jona Christopher Sahnwaldt <jcsahnwa...@gmail.com>:
> Because of result count restrictions, these queries don't
> return all ISO language codes extracted by DBpedia,
> but I think they give a good impression of the data quality
> and coverage (or sometimes lack thereof):
>
> http://dbpedia.org/sparql?query=select+distinct+%3Fs%2C+%3Fo+where{%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2Fiso%3E+%3Fo+.%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2FwikiPageUsesTemplate%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTemplate%3Ainfobox_language%3E+.}
>
> http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&query=select+distinct+%3Fs%2C+%3Fo+where{%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2Fiso%3E+%3Fo+.%3Fs+%3Chttp%3A%2F%2Fdbpedia.org%2Fproperty%2FwikiPageUsesTemplate%3E+%3Chttp%3A%2F%2Fdbpedia.org%2Fresource%2FTemplate%3Ainfobox_language%3E+.}

This is really amazing and I think I'm going to be learning the query
language and possibly spending some time with dbpedia. Have you
thought about doing the same for Wiktionary?

Andrew Dunbar (hippietrail)

> On Fri, Oct 23, 2009 at 18:20, Andrew Dunbar <hippytr...@gmail.com> wrote:
>> 2009/10/23 Aryeh Gregor <simetrical+wikil...@gmail.com>:
>>> On Fri, Oct 23, 2009 at 8:27 AM, Andrew Dunbar <hippytr...@gmail.com> wrote:
>>>> Yes I found how to get it through the API now. It was actually just
>>>> the Toolserver database that was intractably slow.
>>>
>>> There's nothing slow about the TS database here:
>>>
>>> mysql> pager true
>>> PAGER set to 'true'
>>> mysql> SELECT tl_from FROM templatelinks WHERE tl_namespace=10 AND
>>> tl_title IN ('Infobox_Language', 'Infobox_language');
>>> 3144 rows in set (0.12 sec)
>>>
>>> Your query might have been what was slow.
>>
>> Yes I didn't specify tl_namespace and when I check for which columns
>> have keys I could see none:
>> mysql> describe templatelinks;
>> +--------------+-----------------+------+-----+---------+-------+
>> | Field        | Type            | Null | Key | Default | Extra |
>> +--------------+-----------------+------+-----+---------+-------+
>> | tl_from      | int(8) unsigned | NO   |     | 0       |       |
>> | tl_namespace | int(11)         | NO   |     | 0       |       |
>> | tl_title     | varchar(255)    | NO   |     |         |       |
>> +--------------+-----------------+------+-----+---------+-------+
>> 3 rows in set (0.01 sec)
>>
>> But I don't know much about databases and SQL...
>>
>> I have reached an important milestone of extracting all the name value
>> pairs for language infobox ISO 639 language codes and language family
>> string by the way.
>>
>> But the values still need some work before I can try to match them
>> against ISO 639-5 language family codes which is my ultimate goal.
>>
>> Thanks for all the tips.
>>
>> Andrew Dunbar (hippietrail)
>>
>>> _______________________________________________
>>> Wikitech-l mailing list
>>> Wikitech-l@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>>>
>>
>>
>>
>> --
>> http://wiktionarydev.leuksman.com http://linguaphile.sf.net
>>
>> _______________________________________________
>> Wikitech-l mailing list
>> Wikitech-l@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>>
>
> _______________________________________________
> Wikitech-l mailing list
> Wikitech-l@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikitech-l
>



-- 
http://wiktionarydev.leuksman.com http://linguaphile.sf.net

_______________________________________________
Wikitech-l mailing list
Wikitech-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l

Reply via email to