On Monday 02 September 2002 07:15 am, David Levinson wrote:
Each article, as it ages, presumably tends to get longer as people add content. New articles start small. People add facts, they get larger. They spawn incomplete links[?] and new articles are created, but start small.
You just gave me a great idea -- How about we have a population pyramid on the statistics page? See http://www.wikipedia.org/wiki/Netherlands
-- Daniel Mayer (aka mav)
--- Daniel Mayer maveric149@yahoo.com wrote:
You just gave me a great idea -- How about we have a population pyramid on the statistics page? See http://www.wikipedia.org/wiki/Netherlands
Demographics by age? How will you determine the age of an article, particularly for articles that have been renamed, and given that the history only goes back to the automatic conversion script? Or demographics by size? Either would be extremely cool.
Peace, -Karl
__________________________________________________ Do You Yahoo!? Yahoo! Finance - Get real-time stock quotes http://finance.yahoo.com
Karl Juhnke wrote:
--- Daniel Mayer maveric149@yahoo.com wrote:
You just gave me a great idea -- How about we have a population pyramid on the statistics page? See http://www.wikipedia.org/wiki/Netherlands
Demographics by age? How will you determine the age of an article, particularly for articles that have been renamed, and given that the history only goes back to the automatic conversion script?
It would be incomplete data, but at least partial stats on raw age or number of edits might be interesting.
Or demographics by size? Either would be extremely cool.
Scroll down to the bottom of Wikipedia:Statistics.
-- brion vibber (brion @ pobox.com)
Daniel Mayer wrote:
On Monday 02 September 2002 07:15 am, David Levinson wrote:
Each article, as it ages, presumably tends to get longer as people add content. New articles start small. People add facts, they get larger. They spawn incomplete links[?] and new articles are created, but start small.
You just gave me a great idea -- How about we have a population pyramid on the statistics page? See http://www.wikipedia.org/wiki/Netherlands
This is an interesting approach. The article set could be divided into deciles (or some other suitable sized sample). I would guess that the median article size that we currently have is likely fairly constant. The markers for the deciles are likely similarly constant. This is, of course, subject to minor sampling variation. People who use population pyramids let them be indicators of broad changes in populations. Statistically significant variations from key markers would suggest differences in the patterns of participation.
Perhaps one of our more mathematically could do an analysis on "The fractal geometry of Wikipedia participation"
Eclecticology
Ray Saintonge wrote:
Daniel Mayer wrote:
You just gave me a great idea -- How about we have a population pyramid on the statistics page? See http://www.wikipedia.org/wiki/Netherlands
This is an interesting approach. The article set could be divided into deciles (or some other suitable sized sample). I would guess that the median article size that we currently have is likely fairly constant. The markers for the deciles are likely similarly constant. This is, of course, subject to minor sampling variation. People who use population pyramids let them be indicators of broad changes in populations. Statistically significant variations from key markers would suggest differences in the patterns of participation.
Perhaps one of our more mathematically could do an analysis on "The fractal geometry of Wikipedia participation"
Eclecticology
I'd expect something like a [[Zipf distribution]], if there is a scaling principle operating.
Neil Harris
Neil Harris wrote:
Ray Saintonge wrote:
Daniel Mayer wrote:
You just gave me a great idea -- How about we have a population pyramid on the statistics page? See http://www.wikipedia.org/wiki/Netherlands
This is an interesting approach. The article set could be divided into deciles (or some other suitable sized sample). I would guess that the median article size that we currently have is likely fairly constant. The markers for the deciles are likely similarly constant. This is, of course, subject to minor sampling variation. People who use population pyramids let them be indicators of broad changes in populations. Statistically significant variations from key markers would suggest differences in the patterns of participation.
Perhaps one of our more mathematically could do an analysis on "The fractal geometry of Wikipedia participation"
Eclecticology
I'd expect something like a [[Zipf distribution]], if there is a scaling principle operating.
We may both be right. My interpretation of a Zipfian distribution is that it is a one tail curve. For articles less than median size it seems as though the curve for the distribution of article size would be a steeply rising one.
Ec
This is an interesting approach. The article set could be divided into deciles (or some other suitable sized sample). I would guess that the median article size that we currently have is likely fairly constant. The markers for the deciles are likely similarly constant. This is, of course, subject to minor sampling variation. People who use population pyramids let them be indicators of broad changes in populations. Statistically significant variations from key markers would suggest differences in the patterns of participation.
Perhaps one of our more mathematically could do an analysis on "The fractal geometry of Wikipedia participation"
My first guess would be that the article sizes are (from a minimum size that would be the size of an average mini-stub upward) distributed according to a Pareto distribution.
Andre Engels
wikipedia-l@lists.wikimedia.org