Azu schreef:
Hello There are many different archives available for download from Wikipedia so I am not sure which one to get.
Please tell me which one has a list of page titles and category
For example
Sit-Verb Cat-Noun-Animal Him-Pronoun Dog-Noun-Animal Happy-Adjective Washington-Noun-Place Metallica-Noun-Band
What is the URL to the archive that I need to download to get this kind of information?
I don't even care what format it is in. XML, CSV, SQL, I don't care which as long as it's parse-able.
I don't know the URL off the top of my head, but you'll probably want to download the database dump (which is in SQL format). In there is a 'categorylinks' table which lists all category associations.
Roan Kattouw (Catrope)