Thinking that perhaps it's the revisions causing the problem, I have
returned to Special:Export for "Diabetes_mellitus" and this time ticked:
Include only the current revision, not the full
history
Include templates
Save as file
The output file is dramatically smaller, 280kb (due to not including
revisions), however I'm still getting a similar error:
"3 pages (127.413/sec), 33 revs (127.413/sec)
ERROR 1062 (23000) at line 31: Duplicate entry '264148315' for key 1"
Dawson
On 15 Jan 2009, at 12:22, Daniel Kinzler wrote:
Dawson schrieb:
Hello,
I have used Special:Export at en.wikipedia to export
"Diabetes_mellitus" and ticked the box "include templates" (I'm
only
really after the templates).
The resulting XML file is 40.1mb so I decided to go with mwdumper.js
rather than Special:Import.
I'm working on a fresh build of mediawiki on my local system. When
running the command:
java -jar mwdumper.jar --format=sql:1.5
Wikipedia-20090113203939.xml |
mysql -u root -p wiki
It is returning the following error:
1 pages (0.102/sec), 1,000 revs (102.062/sec)
ERROR 1062 (23000) at line 99: Duplicate entry '45970' for key 1
This happens when the XML dump contains the same page twice (or was
it the same
revision, even?). Which shouldn't happen. And if it happens,
mwdumper shouldn't
crash and burn.
I don't know a goos way around this, really, sorry. The question is:
*why* does
the dump include the same page twice? Is that legal in terms of the
dump format?
If yes, why can't mwdumper cope with it?
-- daniel
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l