Unsubscribe
Send Xmldatadumps-l mailing list submissions to
xmldatadumps-l@lists.wikimedia.org
To subscribe or unsubscribe via the World Wide Web, visit
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
or, via email, send a message with subject or body 'help' to
xmldatadumps-l-request@lists.wikimedia.org
You can reach the person managing the list at
xmldatadumps-l-owner@lists.wikimedia.org
When replying, please edit your Subject line so it is more specific
than "Re: Contents of Xmldatadumps-l digest..."
Today's Topics:
1. Having trouble with November 2016 enwiki (Phil Hunt)
----------------------------------------------------------------------
Message: 1
Date: Mon, 2 Jan 2017 03:43:04 +0000
From: Phil Hunt <cabalamat@gmail.com>
To: xmldatadumps-l@lists.wikimedia.org
Subject: [Xmldatadumps-l] Having trouble with November 2016 enwiki
Message-ID:
<CANrmF702ZtJ+MU4yTcbnGAXr4mJdXx8hrUsFs+5CJg+6ffnqQQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"
Hi,
I have a local mediawiki installation and I'm trying to import the Nov 2016
en-wikipedia into it
(enwiki-20161101-pages-articles-multistream.xml.bz)
I'm using importDump.php, but the commands I try cause it to immediately
fall over with an error message. When I do:
$ php importDump.php <
~/data/wikifork/enwiki-20161101-pages-articles-multistream.xml.bz2
I get:
PHP Warning: XMLReader::read():
uploadsource://405216067937ad302ddd05631e3ee446:1: parser error : Document
is empty in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 538
PHP Warning: XMLReader::read(): BZh91AY&SY귟C in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 538
PHP Warning: XMLReader::read(): ^ in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 538
[651d62f6f956c1552494dab7] [no req] MWException from line 542 of
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php:
Expected <mediawiki> tag, got
Backtrace:
#0 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(318):
WikiImporter->doImport()
#1 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(277):
BackupReader->importFromHandle(resource)
#2 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(108):
BackupReader->importFromStdin()
#3 /home/phil/sproj/wikifork/mediawiki/maintenance/doMaintenance.php(111):
BackupReader->execute()
#4 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(323):
require_once(string)
#5 {main}
And when I do:
$ php importDump.php --conf ../LocalSettings.php
~/data/wikifork/enwiki-20161101-pages-articles-multistream.xml.bz2 my_wiki
I get:
PHP Warning: XMLReader::read():
uploadsource://cd1f7162d36eda06a29d64becaf0b825:45: parser error : Extra
content at the end of the document in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 512
PHP Warning: XMLReader::read(): </siteinfo> in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 512
PHP Warning: XMLReader::read(): ^ in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 512
PHP Warning: XMLReader::read(): Load Data before trying to read in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 603
PHP Warning: XMLReader::read(): Load Data before trying to read in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 578
Done!
You might want to run rebuildrecentchanges.php to regenerate RecentChanges
I'm puzzled that I'm getting different error messages here.
Am I going down the wrong track entirely and need to use mwdumper?
--
Phil Hunt, <cabalamat@gmail.com>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.wikimedia.org/pipermail/xmldatadumps-l/attachments/20170102/1249f5a4/attachment-0001.html>
------------------------------
Subject: Digest Footer
_______________________________________________
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
------------------------------
End of Xmldatadumps-l Digest, Vol 81, Issue 1
*********************************************