Jeremy Dunck wrote:
On 10/17/05, Jakob Voss jakob.voss@nichtich.de wrote:
Hi!
Once there was the size of all Wikipedia database dumps at download.wikipedia.org.
I was just pondering this yesterday. Samuel is master with 6x73 GB= 438 GB. Of course, that's not in dump form.
Jakob, this ties in with the earlier request for wikipedia-by-mail. I was thinking of doing a Fundable.org drive for an array so that I could serve those requests, but perhaps using the Tool server makes more sense...
Samuel's InnoDB data files are about 290 GB, but it's likely most of that is free space. There's also about 100 GB distributed across our external storage DBs; the hypothesized free space in samuel is because we moved a lot of the text out of it and into external storage. It's all compressed with gzip.
The current total size of all the pages_full.xml.bz2 files from the latest dump is 14 GB. In total, the wikipedia directory on the download server is using 236 GB, thanks mostly to image tarballs and poorly compressed copies of the text.
-- Tim Starling