Unsubscribe
On Mon, Jan 2, 2017 at 4:00 AM <xmldatadumps-l-request@lists.wikimedia.org> wrote:
Send Xmldatadumps-l mailing list submissions to
        xmldatadumps-l@lists.wikimedia.org

To subscribe or unsubscribe via the World Wide Web, visit
        https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l
or, via email, send a message with subject or body 'help' to
        xmldatadumps-l-request@lists.wikimedia.org

You can reach the person managing the list at
        xmldatadumps-l-owner@lists.wikimedia.org

When replying, please edit your Subject line so it is more specific
than "Re: Contents of Xmldatadumps-l digest..."


Today's Topics:

   1. Having trouble with November 2016 enwiki (Phil Hunt)


----------------------------------------------------------------------

Message: 1
Date: Mon, 2 Jan 2017 03:43:04 +0000
From: Phil Hunt <cabalamat@gmail.com>
To: xmldatadumps-l@lists.wikimedia.org
Subject: [Xmldatadumps-l] Having trouble with November 2016 enwiki
Message-ID:
        <CANrmF702ZtJ+MU4yTcbnGAXr4mJdXx8hrUsFs+5CJg+6ffnqQQ@mail.gmail.com>
Content-Type: text/plain; charset="utf-8"

Hi,

I have a local mediawiki installation and I'm trying to import the Nov 2016
en-wikipedia into it
(enwiki-20161101-pages-articles-multistream.xml.bz)


I'm using importDump.php, but the commands I try cause it to immediately
fall over with an error message. When I do:

$ php importDump.php <
~/data/wikifork/enwiki-20161101-pages-articles-multistream.xml.bz2
I get:

PHP Warning: XMLReader::read():
uploadsource://405216067937ad302ddd05631e3ee446:1: parser error : Document
is empty in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 538

PHP Warning: XMLReader::read(): BZh91AY&SY귟C in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 538

PHP Warning: XMLReader::read(): ^ in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 538

[651d62f6f956c1552494dab7] [no req] MWException from line 542 of
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php:
Expected <mediawiki> tag, got

Backtrace:

#0 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(318):
WikiImporter->doImport()

#1 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(277):
BackupReader->importFromHandle(resource)

#2 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(108):
BackupReader->importFromStdin()

#3 /home/phil/sproj/wikifork/mediawiki/maintenance/doMaintenance.php(111):
BackupReader->execute()

#4 /home/phil/sproj/wikifork/mediawiki/maintenance/importDump.php(323):
require_once(string)

#5 {main}

And when I do:

$ php importDump.php --conf ../LocalSettings.php
~/data/wikifork/enwiki-20161101-pages-articles-multistream.xml.bz2 my_wiki

I get:


PHP Warning: XMLReader::read():
uploadsource://cd1f7162d36eda06a29d64becaf0b825:45: parser error : Extra
content at the end of the document in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 512

PHP Warning: XMLReader::read(): </siteinfo> in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 512

PHP Warning: XMLReader::read(): ^ in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 512

PHP Warning: XMLReader::read(): Load Data before trying to read in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 603

PHP Warning: XMLReader::read(): Load Data before trying to read in
/home/phil/sproj/wikifork/mediawiki/includes/import/WikiImporter.php on
line 578

Done!

You might want to run rebuildrecentchanges.php to regenerate RecentChanges


I'm puzzled that I'm getting different error messages here.

Am I going down the wrong track entirely and need to use mwdumper?


--
Phil Hunt, <cabalamat@gmail.com>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.wikimedia.org/pipermail/xmldatadumps-l/attachments/20170102/1249f5a4/attachment-0001.html>

------------------------------

Subject: Digest Footer

_______________________________________________
Xmldatadumps-l mailing list
Xmldatadumps-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/xmldatadumps-l


------------------------------

End of Xmldatadumps-l Digest, Vol 81, Issue 1
*********************************************
--
~ Cam