Hi, I attempted to import the English Wikipedia into MediaWiki by first downloading the pages-articles.xml.bz2, uncompressing it, splitting it using
xml2sql enwiki-20081008-pages-articles.xml
and finally imported the results using
mysqlimport -u root -p --local wikidb ./{page,revision,text}.txt
I also imported all of the SQL files on http://download.wikimedia.org/enwiki/20081008/
The problem that I am now facing is that the HTML Rendered is wrong in places. Mostly this happens at the beginning of the text on the Page. For example in the beginning the United_Kingdom article I get:
</tr><tr> <th colspan="2">Calling code</th> <td>+44 </td> </tr></table>
After this I get the normal article text i.e. “The United Kingdom of … “ etc.
The result of this is that the rest of the article is not formatted correctly. For example in IE the first paragraph is shifted into a column on the Right. In both IE and Mozilla I do not get the “navigation”, “search”, “interaction”, “toolbox”, languages” and the Sunflower MediaWiki Picture on the Top-Left Corner. (I get these elements in other pages though. I just wanted to illustrate the problems that bad HTML causes.) Another problem I am having is at the top of each page I get “Error: image is invalid or non-existent” – Is there a way to disable this error message. I know that I don’t have the images – and is not a problem for me. I would only prefer not to have this error message in red at the top of the Article.
Any ideas on what I might be doing wrong here?
Thanks, O. O.