[Xmldatadumps-l] Fwd: Number of pages on Wikipedia

Thomas Dalton thomas.dalton at gmail.com
Tue Jun 29 00:20:48 UTC 2010


This went off-list for some reason...


---------- Forwarded message ----------
From: Thomas Dalton <thomas.dalton at gmail.com>
Date: 29 June 2010 01:18
Subject: Re: [Xmldatadumps-l] Number of pages on Wikipedia
To: "Chrisil J. Arackaparambil" <chrisil at lanl.gov>


On 29 June 2010 01:06, Chrisil J. Arackaparambil <chrisil at lanl.gov> wrote:
> Hello everybody,
>
> I was doing a bit of analysis of the dump
> enwiki-20100130-pages-meta-history.xml.7z.  What I found to my surprise
> is that there are (at least) 7 million pages in the main namespace.  I
> got this figure by grepping for page titles that do not contain a ":"
> character.  Is this really the case or am I missing something?  I'd seen
> some Wikimedia stats that said the number of articles currently is about
> 3.2 million, so I'm not sure why I'm seeing so many pages in the dump.

The 3.2 million figure does not include redirects.



More information about the Xmldatadumps-l mailing list