Di (rut) wrote:
Dear All,
I am a PhD student at the University of Copenhagen, studying Distributed Cognition in the Wiki(Pedia) Article. I need to be able to get down some data but I was told (and seen) that the english dump has been broken for a long time.
So, here are few questions for the beloved tech-people:
how does one get a dump of en.wikipedia?
is there a place with historical dumps? Every year or so?
how difficult would it be to make an easy way to download one article's
*whole* history? (does it have to be necessarily 100 edits at a time? even if I only need one article, once?)
Big thanks in advance, rut jesus
I have a copy of enwiki-20070716-pages-meta-history.xml.7z the latest working enwiki history dump. The file is 3,22 Gb i could send it to you or maybe a subset of articles...