On Apr 10, 2020, at 5:55 PM, Francis Franck
<francis.franck(a)gmail.com> wrote:
How do I launch Parsoid's conversion of my website?
Apparently, I succeeded in doing it, because I found an html copy of my site in a
directory called localhost. (I'm running Ubuntu under Windows 10).
Parsoid seems properly installed. The command "curl -L
http://localhost:8142/localhost/v3/page/html/Main_Page/ " gives the expected result
and my website "http://localhost/mediawiki/" uses "VisualEditor".
My first question : how do I instruct Parsoid to start converting my whole site?
Sorry, can you clarify what you're trying to achieve?
Traditionally, Parsoid has been run as a service on the side that clients, like VE, can
use to parse and serialize wikitext.
It sounds like you're trying to produce a static copy of your site for backup or
offline use?
If so, maybe see the project here
https://github.com/openzim/mwoffliner or this previous
discussion
https://lists.wikimedia.org/pipermail/wikitext-l/2020-February/000994.html
And secondly: how can I learn Parsoid to add
".html" to the links it creates ? At present it makes links like "<a
href="/mediawiki/index.php/Dossier:Drebbels_Thermometer"
title="Dossier:Drebbels Thermometer">"
We should probably tackle the first question beforehand because the usefulness of doing
something like this will depend on it.
Parsoid doesn't really offer a configuration to readily do something like this so it
would take some code modification.
Alternatively, you can take the output that Parsoid produces and write a script to parse
the html, find all the links, and add the extension, with generic dom manipulation
libraries.