whilst playing around with the db I got myself a bit confused.
when we say "100,000 articles on x.wikipedia.org" are we simply counting the number of entries in the cur table, or just those where is_cur_redirect=0 ?
http://en.wikipedia.org/wiki/Wikipedia:What_is_an_article
suggests that the figure on
http://en.wikipedia.org/wiki/Special:Statistics
does not include redirects, but that didn't seem to match with what I saw from (an admitadly 2 month old) db dump. Hence confusion.
While I'm hear, is any kind soul tracking things like # of stub articles, # of featured articles, etc? I realise this can be extrapolated from a DB dump, I'm more wondering if anyone is already doing so and publishing the stats somewhere?
Minty wrote in news.gmane.science.linguistics.wikipedia.technical:
when we say "100,000 articles on x.wikipedia.org" are we simply counting the number of entries in the cur table, or just those where is_cur_redirect=0 ?
my understanding is that this is the number of _articles_ in the cur table, where "article" is defined as a non-redirect page in namespace 0 with either 1 comma or 1 link (depending on how $wgUseCommaCount is configured).
does this make sense? i don't have the numbers at hand to check...
kate.
Actually, it's:
* A page in the main namespace * That is not a redirect * Contains one comma if $wgUseCommaCount evaluates to TRUE or alternatively contains "[[" (without the quotes).
wikitech-l@lists.wikimedia.org