Anthony Ventresque (Dr) wrote:
Hi,
I am trying to build an offline version of the wikipedia categorisation tree. As usual with projects on wikipedia, I've downloaded dumps (actually the interesting one here is pages-articles.xml). And I found that none of the dumps has the relation between "Category:1960_works" and "Category:1960" which is present on the web page. And it is the same for a lot of categories I tried: many links are missing in the dump, but are present in the web. Any idea why is that so?
Thanks for your help, Anthony
Using page.sql.gz and categorylinks.sql.gz would be more efficient for your task.