Hello, Emmanuel, and everyone.
I'm still trying to create a ZIM file of the Hebrew Wikipedia dump. I've made some progress -- the dump is complete, but the buildZimFileFromDirectory.pl script is still failing:
The command line I used was:
asaf@abartov-deb:~/dev/kiwix/dumping_tools/scripts$ *./buildZimFileFromDirectory.pl --htmlPath=/home/asaf/dev/hewiki_dump --welcomePage=./index.html*
The output is:
NOTICE: CREATE TABLE will create implicit sequence "article_aid_seq" for serial column "article.aid" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "article_pkey" for table "article" NOTICE: CREATE TABLE will create implicit sequence "category_cid_seq" for serial column "category.cid" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "category_pkey" for table "category" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "categoryarticles_pkey" for table "categoryarticles" NOTICE: CREATE TABLE will create implicit sequence "zimfile_zid_seq" for serial column "zimfile.zid" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "zimfile_pkey" for table "zimfile" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "zimdata_pkey" for table "zimdata" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "zimarticles_pkey" for table "zimarticles" NOTICE: CREATE TABLE will create implicit sequence "indexarticle_xid_seq" for serial column "indexarticle.xid" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "indexarticle_pkey" for table "indexarticle" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "words_pkey" for table "words" NOTICE: CREATE TABLE / PRIMARY KEY will create implicit index "trivialwords_pkey" for table "trivialwords" Use of uninitialized value in concatenation (.) or string at ../classes//Kiwix/ZimIndexer.pm line 805. Use of uninitialized value $welcomePage in concatenation (.) or string at ../classes//Kiwix/ZimIndexer.pm line 746. DBD::Pg::db do failed: ERROR: invalid input syntax for integer: "" at ../classes//Kiwix/ZimIndexer.pm line 385. ERROR: invalid input syntax for integer: ""
The last few lines in all.log are:
[2009-06-30 07:05:43,997] builZimFileFromDirectory.pl - Rewriting url in /home/asaf/dev/hewiki_dump/articles/�×/� /22/Portal~�×� _�_����¢�×__79ba.html [2009-06-30 07:05:44,010] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/�×/� /22/Portal~�×� _�_����¢�×__79ba.html [2009-06-30 07:05:44,243] builZimFileFromDirectory.pl - Rewriting url in /home/asaf/dev/hewiki_dump/articles/�/�¨/� /��¨� ���_���.html [2009-06-30 07:05:44,305] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/�/�¨/� /��¨� ���_���.html [2009-06-30 07:05:44,547] builZimFileFromDirectory.pl - Rewriting url in /home/asaf/dev/hewiki_dump/articles/a/c/h/�§���¥~Achdut.JPG_bd38.html [2009-06-30 07:05:44,567] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/a/c/h/�§���¥~Achdut.JPG_bd38.html [2009-06-30 07:05:44,797] builZimFileFromDirectory.pl - Rewriting url in /home/asaf/dev/hewiki_dump/articles/�/�/�/�×�� ��×~������¤26����.html [2009-06-30 07:05:44,812] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/�/�/�/�×�� ��×~������¤26����.html [2009-06-30 07:05:45,007] builZimFileFromDirectory.pl - Rewriting url in /home/asaf/dev/hewiki_dump/articles/t/r/i/�§���¥~Triumph_of_the_Will_-_Congress_Hall.jpg_2aad.html [2009-06-30 07:05:45,024] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/t/r/i/�§���¥~Triumph_of_the_Will_-_Congress_Hall.jpg_2aad.html [2009-06-30 07:05:45,249] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/�×/�/�×/�×��×��×_������×.html [2009-06-30 07:05:45,509] builZimFileFromDirectory.pl - Rewriting url in /home/asaf/dev/hewiki_dump/articles/�/�/�/������ [2009-06-30 07:05:45,578] builZimFileFromDirectory.pl - Adding to DB /home/asaf/dev/hewiki_dump/articles/�/�/�/������
Any idea what is wrong? Any more diagnostics I can provide?
Thanks in advance,
Asaf
On Mon, Apr 20, 2009 at 11:11 PM, Emmanuel Engelhart <emmanuel@engelhart.org
wrote:
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Asaf Bartov a écrit :
So far, I've been able to compile and run all the tools, but am having
some
trouble creating ZIM files: I've dumped a locally-loaded Mediawiki installation to HTML using these instructions <
http://openzim.org/Wiki2html%3E,
but when I try to run the builZimFileFromDirectory.pl script, I get silly postgresql errors about failing to connect using "kiwix" user.
I have commited to Kiwix svn a new version of builZimFileFromDirectory.pl allowing to specify the dbUser and dbPassword on the command line.
Emmanuel -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org
iEYEARECAAYFAkns1vEACgkQn3IpJRpNWtNliwCeKyaPwXWdCr5rclkuYvqgCa2m wdMAn2F+sgUByhQT5g4o6XoD7WJeIeIL =USFE -----END PGP SIGNATURE-----