DBpedia has a file [1] which contains all the data you need, albeit wrapped in Scala code. This file is generated by another Scala class [2] that calls the Wikipedia API URLs that John pointed out. You could modify that class to produce any format you like. If you don't know Scala, you might be better off writing the code from scratch though.
JC
[1] https://github.com/dbpedia/extraction-framework/blob/dump/core/src/main/scal... [2] https://github.com/dbpedia/extraction-framework/blob/dump/core/src/main/scal...
On 3 July 2013 00:18, Byrial Jensen byrial@vip.cybercity.dk wrote:
At 02-07-2013 21:49, John wrote:
http://en.wikipedia.org/w/api.php?action=query&meta=siteinfo&siprop=...
should be what you need.
Thank you. If I change to a nonformatted format like xml or json, and repeat for all 286 Wikipedias (first, later I may also need other projects) it will do with some work. It is good I have curl to help here ...
Regards,
- Byrial
Xmldatadumps-l mailing list Xmldatadumps-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l