Hello Everyone,
I'm working on converting an old Ruby driven wiki that uses textile markup to mediawiki. I'm getting hung up on the process of importing the data into the database for mediawiki.
Right now I have about 300 .html files that need to become wiki pages. The wiki page name would be the name of the html file. Can anyone shed some light on where to look for either a script that does something similer, or even some assistance on where and how the text within the mediawiki db is stored.
The perl module HTML::WikiConverter works great for the translation, as long as the HTML isn't too nasty (say, the HTML output of Microsoft Office). Try it here:
http://diberri.dyndns.org/wikipedia/html2wiki/
DanB
-----Original Message----- Right now I have about 300 .html files that need to become wiki pages. The wiki page name would be the name of the html file.
Dan,
That looks like it would work for a page or two. I'm looking for a way to atomate the process. The main issue we are experiancing isn't altering the syntax to wiki markup but actually importing it into the db. We are trying to streamline the process so it doens't require days of pasting content.
Thanks for the link !
I think you might misunderstand what the software does. It's a general Perl module for doing the conversion. The web page I mentioned is merely somebody's user interface slapped on top of the module. It should be easy for any Perl programmer to write a script that processes your 300 HTML files using the module, and produces 300 Wikitext files, which you can then import into the wiki with a standard MediaWiki maintenance script. An intermediate-level Perl programmer should be able to do this in a few hours max.
DanB ________________________________________ That looks like it would work for a page or two. I'm looking for a way to atomate the process. The main issue we are experiancing isn't altering the syntax to wiki markup but actually importing it into the db. We are trying to streamline the process so it doens't require days of pasting content.
We have converted all of the files to wikitext , now we just need to find the simple maintenance script to import them. Sorry about the misunderstanding, and thank you for your advice. The converter worked great.
Jon Welters wrote:
We have converted all of the files to wikitext , now we just need to find the simple maintenance script to import them. Sorry about the misunderstanding, and thank you for your advice. The converter worked great.
maintenance/importTextFile.php
mediawiki-l@lists.wikimedia.org