On Thu, Sep 3, 2009 at 3:27 PM, James Richard<james.richard050(a)gmail.com> wrote:
Thanks for the detailed answer. I will use the dumps.
Out of curiosity,
though, can you tell me where that explicit live mirror prohibition is
stated? I couldn't find any controlling documents on the subject.
http://meta.wikimedia.org/wiki/Live_mirrors
I don't know how formal or authoritative that is. You might want to
ask someone like Brion. I think the answer in practice is that
nobody's going to waste time blocking you if you don't cause
noticeable load, but I don't know if there's an official statement
anywhere. I vaguely recall that some sites might pay Wikimedia a fee
to do commercial live mirroring, but I'm not sure on that.
Again,
I'm referring to fetching the wiktionary mark-up source document through the
API, not the rendered page.
The cost of the API is roughly the same to the servers as of the
rendered page, or perhaps higher (due to worse caching). It's just in
a more bot-friendly format.