Amir, Gerard:
The easiest way to test locally at the moment is to use the standalone
'mw-ocg-bundler' and 'mw-ocg-latexer' node packages. There are good
installation instructions in the READMEs, see:
https://npmjs.org/package/mw-ocg-bundler
https://npmjs.org/package/mw-ocg-latexer
and let me know if I need to document anything better.
This will let you pull individual articles from an arbitrary wiki, and
then typeset them with xelatex.
There is currently good support for quite a number of languages. My
standard test case contains:
http://ar.wikipedia.org/wiki/ليونيل_ميسي
http://ar.wikipedia.org/wiki/بشير_الثاني_الشهابي
http://ar.wikipedia.org/wiki/حمزة_بن_عبد_المطلب
http://ar.wikipedia.org/wiki/إسطنبول
http://ar.wikipedia.org/wiki/الحرب_الإنجليزية_الزنجبارية
http://de.wikipedia.org/wiki/Papier
http://en.wikipedia.org/wiki/Durian
http://es.wikipedia.org/wiki/Latas_de_sopa_Campbell
http://fa.wikipedia.org/wiki/کعبه_زرتشت
http://fr.wikipedia.org/wiki/Trachylepis_atlantica
http://he.wikipedia.org/wiki/ספרטה
http://hi.wikipedia.org/wiki/रामायण
http://it.wikipedia.org/wiki/La_vita_è_meravigliosa
http://ja.wikipedia.org/wiki/熊野三山本願所
http://ja.wikipedia.org/wiki/金星の日面通過
http://ko.wikipedia.org/wiki/조화진동자
http://ml.wikipedia.org/wiki/മലയാളം
http://pl.wikipedia.org/wiki/Efekt_potwierdzenia
http://pt.wikipedia.org/wiki/Scaphyglottis
http://ru.wikipedia.org/wiki/Битва_при_Платеях
http://simple.wikipedia.org/wiki/Taoism
http://vi.wikipedia.org/wiki/Vệ_tinh_tự_nhiên_của_Sao_Thiên_Vương
http://zh.wikipedia.org/wiki/納粹德國海軍
and a few other English articles. That said, I don't read most of
these languages, so I've mostly been trying to ensure that our output
matches the HTML displayed by the wiki. It is quite possible I've
chosen bad-looking fonts, or that there are other details that could
be improved. (For example, the way that Vietnamese stacked accents
was bad for a while; I've fixed that now.) Comments eagerly requested!
--scott
ps. there are a number of minor issues with citations in RTL
languages, even in our standard HTML rendering on the wikis; it
appears that our citation templates should be more aggressive about
adding <bdi> tags or lang attributes to ensure that citations of LTR
sources in an RTL article are displayed as nicely as possible. If
these fixes are made to the source, the latex output should inherit
them.