On 22/03/13 01:16, Jiang BIAN wrote:
Thanks for detailed instructions. A few minor things
still not clear to
me, inline:
On Thu, Mar 21, 2013 at 1:02 PM, Richard Farmbrough
<richard(a)farmbrough.co.uk <mailto:richard@farmbrough.co.uk>> wrote:
The only fifth exception is Wikidata:
So you need
1. Mediawiki + the same extensions zh:Wikipedia uses
How can I know what extensions is used on zh:Wikipedia? and what's its
config?
The configuration of the wikipedias is at
http://noc.wikimedia.org/conf/
(although not very "clean"...)
For the list of extensions used, I recommend looking at
http://zh.wikipedia.org/wiki/Special:Version
2. The dumps (including cats and templates)
Looks to me zhwiki-20130315-pages-articles-multistream.xml.bz2
<http://dumps.wikimedia.org/zhwiki/20130315/zhwiki-20130315-pages-articles-multistream.xml.bz2>
is
the one I want, right?
Yes, either zhwiki-20130330-pages-articles.xml.bz2 or
zhwiki-20130330-pages-articles.xml.bz2 should be enough (it's the same
content), as you probably only need the articles.
3. Instacommons (hadn't heard of this before,
sounds cool)
Sounds like this is a config setting, right?
Right.
4. The config setup of zh:Wikipedia, most of which
is public
(including all the bits you need probably)
5. Some way to talk to Wikidata
All the best, Richard.