Hi Scott,
I tried to run it.
If I understand the docs correctly, then the command node bin/mw-ocg-bundler -o bundle.zip --prefix he --title "ASCII" is supposed to create a bundle from the Hebrew Wikipedia article "ASCII". (It's title is in written in Latin letter.)
When I try to run it, I get the following error: Either the -m option or a page title is required.
Also, the help screen says this: --title <title> Set the title of the generated collection
So is the --title option for the collection title of for the input article title?
Thanks!
-- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com “We're living in pieces, I want to live in peace.” – T. Moore
2014/1/21 C. Scott Ananian cananian@wikimedia.org
Amir, Gerard: The easiest way to test locally at the moment is to use the standalone 'mw-ocg-bundler' and 'mw-ocg-latexer' node packages. There are good installation instructions in the READMEs, see:
https://npmjs.org/package/mw-ocg-bundler https://npmjs.org/package/mw-ocg-latexer
and let me know if I need to document anything better.
This will let you pull individual articles from an arbitrary wiki, and then typeset them with xelatex.
There is currently good support for quite a number of languages. My standard test case contains: http://ar.wikipedia.org/wiki/%D9%84%D9%8A%D9%88%D9%86%D9%8A%D9%84_%D9%85%D9%... http://ar.wikipedia.org/wiki/%D8%A8%D8%B4%D9%8A%D8%B1_%D8%A7%D9%84%D8%AB%D8%... http://ar.wikipedia.org/wiki/%D8%AD%D9%85%D8%B2%D8%A9_%D8%A8%D9%86_%D8%B9%D8... http://ar.wikipedia.org/wiki/%D8%A5%D8%B3%D8%B7%D9%86%D8%A8%D9%88%D9%84 http://ar.wikipedia.org/wiki/%D8%A7%D9%84%D8%AD%D8%B1%D8%A8_%D8%A7%D9%84%D8%... http://de.wikipedia.org/wiki/Papier http://en.wikipedia.org/wiki/Durian http://es.wikipedia.org/wiki/Latas_de_sopa_Campbell http://fa.wikipedia.org/wiki/%DA%A9%D8%B9%D8%A8%D9%87_%D8%B2%D8%B1%D8%AA%D8%... http://fr.wikipedia.org/wiki/Trachylepis_atlantica http://he.wikipedia.org/wiki/%D7%A1%D7%A4%D7%A8%D7%98%D7%94 http://hi.wikipedia.org/wiki/%E0%A4%B0%E0%A4%BE%E0%A4%AE%E0%A4%BE%E0%A4%AF%E... http://it.wikipedia.org/wiki/La_vita_%C3%A8_meravigliosa http://ja.wikipedia.org/wiki/%E7%86%8A%E9%87%8E%E4%B8%89%E5%B1%B1%E6%9C%AC%E... http://ja.wikipedia.org/wiki/%E9%87%91%E6%98%9F%E3%81%AE%E6%97%A5%E9%9D%A2%E... http://ko.wikipedia.org/wiki/%EC%A1%B0%ED%99%94%EC%A7%84%EB%8F%99%EC%9E%90 http://ml.wikipedia.org/wiki/%E0%B4%AE%E0%B4%B2%E0%B4%AF%E0%B4%BE%E0%B4%B3%E... http://pl.wikipedia.org/wiki/Efekt_potwierdzenia http://pt.wikipedia.org/wiki/Scaphyglottis http://ru.wikipedia.org/wiki/%D0%91%D0%B8%D1%82%D0%B2%D0%B0_%D0%BF%D1%80%D0%... http://simple.wikipedia.org/wiki/Taoism http://vi.wikipedia.org/wiki/V%E1%BB%87_tinh_t%E1%BB%B1_nhi%C3%AAn_c%E1%BB%A... http://zh.wikipedia.org/wiki/%E7%B4%8D%E7%B2%B9%E5%BE%B7%E5%9C%8B%E6%B5%B7%E...
and a few other English articles. That said, I don't read most of these languages, so I've mostly been trying to ensure that our output matches the HTML displayed by the wiki. It is quite possible I've chosen bad-looking fonts, or that there are other details that could be improved. (For example, the way that Vietnamese stacked accents was bad for a while; I've fixed that now.) Comments eagerly requested! --scott
ps. there are a number of minor issues with citations in RTL languages, even in our standard HTML rendering on the wikis; it appears that our citation templates should be more aggressive about adding <bdi> tags or lang attributes to ensure that citations of LTR sources in an RTL article are displayed as nicely as possible. If these fixes are made to the source, the latex output should inherit them.
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l