[Xmldatadumps-l] Fwd: Number of pages on Wikipedia
Thomas Dalton
thomas.dalton at gmail.com
Tue Jun 29 00:20:48 UTC 2010
This went off-list for some reason...
---------- Forwarded message ----------
From: Thomas Dalton <thomas.dalton at gmail.com>
Date: 29 June 2010 01:18
Subject: Re: [Xmldatadumps-l] Number of pages on Wikipedia
To: "Chrisil J. Arackaparambil" <chrisil at lanl.gov>
On 29 June 2010 01:06, Chrisil J. Arackaparambil <chrisil at lanl.gov> wrote:
> Hello everybody,
>
> I was doing a bit of analysis of the dump
> enwiki-20100130-pages-meta-history.xml.7z. What I found to my surprise
> is that there are (at least) 7 million pages in the main namespace. I
> got this figure by grepping for page titles that do not contain a ":"
> character. Is this really the case or am I missing something? I'd seen
> some Wikimedia stats that said the number of articles currently is about
> 3.2 million, so I'm not sure why I'm seeing so many pages in the dump.
The 3.2 million figure does not include redirects.
More information about the Xmldatadumps-l
mailing list