Hi everyone,
I am trying to import the page table from the 20080312 dump and I am getting a duplicate entry error. Does anyone know if there is a problem with this dump, and if not could someone help me figure out what I'm doing wrong?
I am using the following command to import the file:
mysql -u wiki -p wiki20080312 < enwiki-20080312-page.sql
And received the following error:
ERROR 1062 (23000) at line 338: Duplicate entry '0-' for key 2
This seems to be the page_namespace + page_title key which is broken, so it looks like there are multiple pages with no title? Any ideas on how to get around this?
Mark
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mark Ferguson wrote:
ERROR 1062 (23000) at line 338: Duplicate entry '0-' for key 2
Check your encoding setup.
- -- brion vibber (brion @ wikimedia.org)
The character set on the table is UTF8, and I reran the command using the option --default-character-set=utf8 to be sure, but I'm still getting the same problem.
Mark
On Tue, Apr 15, 2008 at 12:31 PM, Brion Vibber brion@wikimedia.org wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mark Ferguson wrote:
ERROR 1062 (23000) at line 338: Duplicate entry '0-' for key 2
Check your encoding setup.
- -- brion vibber (brion @ wikimedia.org)
-----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.8 (Darwin) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkgE9HgACgkQwRnhpk1wk45p9wCbB1rkVCKImOuVyuf4GYagRxQU sAEAn3/MSoF53ErPho/FIziNkDyVVQlO =xQ+O -----END PGP SIGNATURE-----
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mark Ferguson wrote:
The character set on the table is UTF8, and I reran the command using the option --default-character-set=utf8 to be sure, but I'm still getting the same problem.
This might be due to 4-byte characters, which MySQL's UTF-8 mode doesn't support properly.
Try setting up the tables with the binary schema, which may help...
- -- brion
Hi Brion,
It worked to use the binary character set, thanks very much for the help!
Mark
On Tue, Apr 15, 2008 at 3:32 PM, Brion Vibber brion@wikimedia.org wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mark Ferguson wrote:
The character set on the table is UTF8, and I reran the command using
the
option --default-character-set=utf8 to be sure, but I'm still getting
the
same problem.
This might be due to 4-byte characters, which MySQL's UTF-8 mode doesn't support properly.
Try setting up the tables with the binary schema, which may help...
- -- brion
-----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.8 (Darwin) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkgFHu8ACgkQwRnhpk1wk45MwACdGapdCmeHj1DlksmhTjc8fbaU c8sAni7rnVecT6ZgFlG8oGcdbiubTngn =EpEL -----END PGP SIGNATURE-----
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
So... I'm guessing that the solution to some other encoding problems I'm seeing is to change all the tables on all my wikis from whatever default collation was used when the dbs were created to binary? I'm guessing that may be interesting...
Sometimes I think I should have stayed with mySQL4...
Jim
On Apr 15, 2008, at 4:32 PM, Brion Vibber wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mark Ferguson wrote:
The character set on the table is UTF8, and I reran the command using the option --default-character-set=utf8 to be sure, but I'm still getting the same problem.
This might be due to 4-byte characters, which MySQL's UTF-8 mode doesn't support properly.
Try setting up the tables with the binary schema, which may help...
- -- brion
-----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.8 (Darwin) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkgFHu8ACgkQwRnhpk1wk45MwACdGapdCmeHj1DlksmhTjc8fbaU c8sAni7rnVecT6ZgFlG8oGcdbiubTngn =EpEL -----END PGP SIGNATURE-----
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
===================================== Jim Hu Associate Professor Dept. of Biochemistry and Biophysics 2128 TAMU Texas A&M Univ. College Station, TX 77843-2128 979-862-4054
wikitech-l@lists.wikimedia.org