Hi, I attempted to import the English Wikipedia into MediaWiki by first downloading the pages-articles.xml.bz2, uncompressing it, splitting it using
xml2sql enwiki-20081008-pages-articles.xml
and finally imported the results using
mysqlimport -u root -p --local wikidb ./{page,revision,text}.txt
I also imported all of the SQL files on http://download.wikimedia.org/enwiki/20081008/
The problem that I am now facing is that the HTML Rendered is wrong in places. Mostly this happens at the beginning of the text on the Page. For example in the beginning the United_Kingdom article I get:
</tr><tr> <th colspan="2">Calling code</th> <td>+44 </td> </tr></table>
After this I get the normal article text i.e. “The United Kingdom of … “ etc.
The result of this is that the rest of the article is not formatted correctly. For example in IE the first paragraph is shifted into a column on the Right. In both IE and Mozilla I do not get the “navigation”, “search”, “interaction”, “toolbox”, languages” and the Sunflower MediaWiki Picture on the Top-Left Corner. (I get these elements in other pages though. I just wanted to illustrate the problems that bad HTML causes.) Another problem I am having is at the top of each page I get “Error: image is invalid or non-existent” – Is there a way to disable this error message. I know that I don’t have the images – and is not a problem for me. I would only prefer not to have this error message in red at the top of the Article.
Any ideas on what I might be doing wrong here?
Thanks, O. O.
It's likely that you aren't doing anything wrong, but that the wikisyntax is wrong. Try enabling Tidy to get the "corrected" html.
Hi,
On Fri, Mar 13, 2009 at 8:37 PM, O. O. olson_ot@yahoo.com wrote:
Hi, I attempted to import the English Wikipedia into MediaWiki by first
...
The problem that I am now facing is that the HTML Rendered is wrong in places. Mostly this happens at the beginning of the text on the Page. For example in the beginning the United_Kingdom article I get:
</tr><tr> <th colspan="2">Calling code</th> <td>+44 </td> </tr></table>
This is immediately preceded by <ref> tags, implemented by the "Cite.php" extension. Did you install that? May be your problem. Check that you have all the extensions that the 'pedia routinely uses.
Another problem I am having is at the top of each page I get “Error: image is invalid or non-existent” – Is there a way to disable this error message. I know that I don’t have the images – and is not a problem for me. I would only prefer not to have this error message in red at the top of the Article.
A simple trick: this is a mediawiki message that can be modified. Edit Mediawiki:imagemap_invalid_image and change it to (not to blank, as that just gets you the default message). (Why this message starts with "imagemap" I don't know, but it doesn't seem to appear anywhere else? Try it at least.) You'll still get a blank line etc in some cases.
Robert
Thanks Robert.
Robert Ullmann wrote:
Hi,
This is immediately preceded by <ref> tags, implemented by the "Cite.php" extension. Did you install that? May be your problem. Check that you have all the extensions that the 'pedia routinely uses.
Yes this was a good guess. I actually could not get the Latest version of the Cite Extension http://www.mediawiki.org/wiki/Extension:Cite to work with the Latest version of MediaWiki i.e. version 1.14.0. The version 1.14.0 version gives me an error on the Main_Page
Fatal error: Call to undefined method ParserOptions::getIsSectionPreview() in /var/www/wiki2/extensions/Cite/Cite_body.php on line 699
So instead I am using the 1.13.0 version of Cite Extension.
A simple trick: this is a mediawiki message that can be modified. Edit Mediawiki:imagemap_invalid_image and change it to (not to blank, as that just gets you the default message). (Why this message starts with "imagemap" I don't know, but it doesn't seem to appear anywhere else? Try it at least.) You'll still get a blank line etc in some cases.
I did not understand where I have to edit “imagemap_invalid_image” I am new here, so I am not sure which file I have to edit.
Thanks again, O. O.
O. O. schreef:
I did not understand where I have to edit “imagemap_invalid_image” I am new here, so I am not sure which file I have to edit.
You have to edit [[MediaWiki:imagemap_invalid_image]], which is a wiki page, not a file.
Roan Kattouw (Catrope)
Roan Kattouw wrote:
O. O. schreef:
I did not understand where I have to edit “imagemap_invalid_image” I am new here, so I am not sure which file I have to edit.
You have to edit [[MediaWiki:imagemap_invalid_image]], which is a wiki page, not a file.
Roan Kattouw (Catrope)
Thanks Roan for this Tip. For others who come this way – you need to edit this page only with a Wikiadmin Account i.e. WikiSysop by default.
I think I am going to make a new thread regarding the Cite Extension.
Thanks, O. O.
wikitech-l@lists.wikimedia.org