Nemo is referring to the dumpgenerator.py being broken on MediaWiki
versions above 1.20, and it should not actually affect older MediaWiki
You can safely continue with your grab. :)
On Sat, Nov 10, 2012 at 12:45 PM, Scott Boyd <scottdb56(a)gmail.com> wrote:
At this link:
the bottom, there is an entry by project member nemowiki that states:
Comment 7 <https://code.google.com/p/wikiteam/issues/detail?id=56#c7>by
nemowiki <https://code.google.com/u/101255742639286016490/>, Today (9
Fixed by emijrp in r806
So does that mean this problem that "It's completely broken" is now fixed?
I'm running a huge download of 64K+ page titles, and am now using the
"r806" version of dumpgenerator.py. The first 35K+ page titles were
downloaded with an older version). Both versions sure seem to be
downloading MORE than 500 pages per namespace, but I'm not sure, since I
don't know how you can tell if you are getting them all...
So is it fixed or not?
On Fri, Nov 9, 2012 at 4:27 AM, Federico Leva (Nemo) <nemowiki(a)gmail.com>wrote;wrote:
It's completely broken:
It will download only a fraction of the wiki, 500 pages at most per
We've created the greatest collection of shared knowledge in history. Help
protect Wikipedia. Donate now: http://donate.wikimedia.org