Hi Johannes,
Johannes Kroll schreef op 15-12-2013 16:27:
I would love to have some sort of dump or (even
better) a central
service I can query. It should contain for all Wikimedia projects:
* Page links (page A links to page B)
this should be in the pagelinks table in the database replica.
Maybe I wasn't clear. I know how MediaWiki works and what tables to
query [1], but it isn't designed for recursion or crawling it as a
directed graph. That really kills performance and doesn't scale at all.
You need a custom setup for that.
Maarten
[1]
https://commons.wikimedia.org/wiki/File:MediaWiki_database_schema_latest.svg