Thinking that perhaps it's the revisions causing the problem, I have returned to Special:Export for "Diabetes_mellitus" and this time ticked:
Include only the current revision, not the full history Include templates Save as file
The output file is dramatically smaller, 280kb (due to not including revisions), however I'm still getting a similar error:
"3 pages (127.413/sec), 33 revs (127.413/sec)
ERROR 1062 (23000) at line 31: Duplicate entry '264148315' for key 1"
Dawson
On 15 Jan 2009, at 12:22, Daniel Kinzler wrote:
Dawson schrieb:
Hello,
I have used Special:Export at en.wikipedia to export "Diabetes_mellitus" and ticked the box "include templates" (I'm only really after the templates).
The resulting XML file is 40.1mb so I decided to go with mwdumper.js rather than Special:Import.
I'm working on a fresh build of mediawiki on my local system. When running the command:
java -jar mwdumper.jar --format=sql:1.5 Wikipedia-20090113203939.xml | mysql -u root -p wiki
It is returning the following error:
1 pages (0.102/sec), 1,000 revs (102.062/sec) ERROR 1062 (23000) at line 99: Duplicate entry '45970' for key 1
This happens when the XML dump contains the same page twice (or was it the same revision, even?). Which shouldn't happen. And if it happens, mwdumper shouldn't crash and burn.
I don't know a goos way around this, really, sorry. The question is: *why* does the dump include the same page twice? Is that legal in terms of the dump format? If yes, why can't mwdumper cope with it?
-- daniel
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l