On 2/23/09 3:08 AM, Marco Schuster wrote:
Even if you had the dumps, you have another problem: They're incredibly big and so a bit difficult to parse. So, a small suggestion if the dumps will ever be workin' again: Split the history and current db stuff by alphabet, please.
Define alphabet -- how should Chinese and Japanese texts be broken up?
We're much more likely to break them up simply by page ID.
PS: Are there any measurements what traffic is generated by ppl who download the dumps?
Not currently.
Have there been any attempts to distribute them via BitTorrent?
By third parties, with AFAIK very little usage.
-- brion