Thanks a lot for your. It worked!!! But the problem is the process is very very slow :( I
started to run it two days ago and it's still running… Do you know why?
El 23/07/2013, a les 1:21, Felipe Ortega <glimmer_phoenix(a)yahoo.es> va escriure:
De: A B <m311man(a)gmail.com>
Enviado: Lunes 22 de julio de 2013 21:44
Asunto: [Xmldatadumps-l] Import logging
I'm trying to import the enwiki-pages-logging.xml into a MySQL database and I'm
having a lot of troubles converting de XML into a SQL statements. I'm using
importDump.php to do this conversion, but I'm getting an error when the script tries
to import a register with the next data:
It seems an encoding problem, but I think I have everything correct. Does anybody have a
I'm not sure what is the problem with importDump.php, but it does seem like an
encoding issue. Can you provide the full error dump message? A first sanity check I can
think of is to verify the character-set of your MySQL server. I always use (in the MySQL
character_set_server = 'utf8'
As an alternative, I have been consistently importing pages-logging.xml dumps for
different Wikipedia languages with the script "pages_logging.py" in my WikiDAT
The only (additional) dependencies is to have Python and the Python MySQLdb module
installed (you have not say which is your operating system). First, you need to have a
MySQL database already created, as well as the logging table according to this schema:
(logging table is at the end of the file).
Usage is (from command line):
$ python pages_logging.py db_name db_user db_passw dump_file log_file
$ python pages_logging.py enwiki foouser foopassw enwiki-pages-logging.xml.gz
It may create a bit more info than you strictly need, but you should be able to import
the dump file without issues. If you find any problems, just let me know and I can try to
help solve them.
> Thanks in advance.
> Xmldatadumps-l mailing list