I have a question on image table from wikipedia DB. Currently this is what is available from wiki dumps: dumps.wikimedia.org/enwiki/latest/enwiki-latest-image.sql.gz regarding populating image table content. When imported it only populates 800000 * records which is clearly not the info for all en wikipedia images. Is this possible to get metadata for all the images for English wikipedia?
On 28/02/13 15:25, Ewa Szwed wrote:
I have a question on image table from wikipedia DB. Currently this is what is available from wiki dumps: dumps.wikimedia.org/enwiki/latest/enwiki-latest-image.sql.gz http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-image.sql.gz regarding populating image table content. When imported it only populates 800000 * records which is clearly not the info for all en wikipedia images. Is this possible to get metadata for all the images for English wikipedia?
They are the enwikipedia files: http://en.wikipedia.org/wiki/Special:Statistics → Uploaded files 803,985
What you are probably missing are the images uploaded on Wikimedia Commons, which can be automatically used on all WMF projects.
So you may want to complement the enwiki specific file list with: http://dumps.wikimedia.org/commonswiki/latest/commonswiki-latest-image.sql.g...
(Warning: it's a large file, 16M entries)
mediawiki-api@lists.wikimedia.org