On Thu, Sep 3, 2009 at 3:27 PM, James Richardjames.richard050@gmail.com wrote:
Thanks for the detailed answer. I will use the dumps. Out of curiosity, though, can you tell me where that explicit live mirror prohibition is stated? I couldn't find any controlling documents on the subject.
http://meta.wikimedia.org/wiki/Live_mirrors
I don't know how formal or authoritative that is. You might want to ask someone like Brion. I think the answer in practice is that nobody's going to waste time blocking you if you don't cause noticeable load, but I don't know if there's an official statement anywhere. I vaguely recall that some sites might pay Wikimedia a fee to do commercial live mirroring, but I'm not sure on that.
Again, I'm referring to fetching the wiktionary mark-up source document through the API, not the rendered page.
The cost of the API is roughly the same to the servers as of the rendered page, or perhaps higher (due to worse caching). It's just in a more bot-friendly format.