On 1/13/07, Jon Noring <jon(a)noring.name> wrote:
I'm curious as to the size (in bytes) of the
current Wikipedia.
That is, if one took a snapshot of the Wikipedia in web form (including
markup, images, multimedia, etc.), how large would it be? If the web
documents were compressed, then how large would it be? (This would not
include edit history information which I assume is substantial -- only
interested in a snapshot of the current pages.)
You can go to
http://download.wikipedia.org/ and look at the static
downloads. The current html dowload for the English wikipedia is about
5.5 GB, compressed with 7-zip.
In the tables at
http://stats.wikimedia.org/EN/TablesWikipediaEN.htm
the number of "binaries" (images, audio etc) was 620k last October. I
don't see the actual size in bytes anywhere.
Alfio