Some of these images may be located on a remote wiki, In this case it would be Commons, so they won't have a local file description page in most cases. You will need to grab a copy of the commons dumps for these.
On Sat, Apr 6, 2013 at 7:21 AM, Keith Schacht krschacht@gmail.com wrote:
Hi, I've downloaded the latest set of wikimedia dumps. I'm trying to understand where to find images within these dumps. I've studied the database schema and it seems to make sense, but then I take a single example such as:
http://en.wikipedia.org/wiki/File:Carrizo_2a.JPG
And I grep the dumps 'image', 'imagelinks', and 'page' looking for 'Carrizo_2a.JPG' and it's not found. I tried this on both the SQL and XML dumps.
Are these dumps not complete? Am I misunderstanding the structure?
Thanks in advance, Keith
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l