2009/10/23 Jona Christopher Sahnwaldt jcsahnwaldt@gmail.com:
Because of result count restrictions, these queries don't return all ISO language codes extracted by DBpedia, but I think they give a good impression of the data quality and coverage (or sometimes lack thereof):
http://dbpedia.org/sparql?query=select+distinct+%3Fs%2C+%3Fo+where%7B%3Fs+%3...
http://dbpedia.org/sparql?default-graph-uri=http%3A%2F%2Fdbpedia.org&que...
This is really amazing and I think I'm going to be learning the query language and possibly spending some time with dbpedia. Have you thought about doing the same for Wiktionary?
Andrew Dunbar (hippietrail)
On Fri, Oct 23, 2009 at 18:20, Andrew Dunbar hippytrail@gmail.com wrote:
2009/10/23 Aryeh Gregor Simetrical+wikilist@gmail.com:
On Fri, Oct 23, 2009 at 8:27 AM, Andrew Dunbar hippytrail@gmail.com wrote:
Yes I found how to get it through the API now. It was actually just the Toolserver database that was intractably slow.
There's nothing slow about the TS database here:
mysql> pager true PAGER set to 'true' mysql> SELECT tl_from FROM templatelinks WHERE tl_namespace=10 AND tl_title IN ('Infobox_Language', 'Infobox_language'); 3144 rows in set (0.12 sec)
Your query might have been what was slow.
Yes I didn't specify tl_namespace and when I check for which columns have keys I could see none: mysql> describe templatelinks; +--------------+-----------------+------+-----+---------+-------+ | Field | Type | Null | Key | Default | Extra | +--------------+-----------------+------+-----+---------+-------+ | tl_from | int(8) unsigned | NO | | 0 | | | tl_namespace | int(11) | NO | | 0 | | | tl_title | varchar(255) | NO | | | | +--------------+-----------------+------+-----+---------+-------+ 3 rows in set (0.01 sec)
But I don't know much about databases and SQL...
I have reached an important milestone of extracting all the name value pairs for language infobox ISO 639 language codes and language family string by the way.
But the values still need some work before I can try to match them against ISO 639-5 language family codes which is my ultimate goal.
Thanks for all the tips.
Andrew Dunbar (hippietrail)
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- http://wiktionarydev.leuksman.com http://linguaphile.sf.net
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l