After running mwdumper to strip out the NS_MEDIAWIKI namespace entries from the 20070206 SQL dumps, the output filtered XML file created by mwdumper after stripping out the name space has sql syntax errors in the output:
1. The output files cannot be used by mwimport because the text labels for XML types, etc, are modified by mwdumper to the extent the program can no longer read the dump. 2. If you take the XML file output by mwdumper and attempt to reimport it into an empty database with mwdumper, it produces corrupted SQL statements and fails. It will process about 720,000 articles however, before failing. Output from mysql error log provided.
ERROR 1062 (23000) at line 15327: Duplicate entry '70473566' for key 1 ERROR 1064 (42000) at line 15328: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near ''== Greatest Common Factor / Least Common Multiple Problem ==\n\nI recently went' at line 1
Jeff