[Wikimedia-l] Thanking anonymous users
delirium at hackish.org
Fri Jan 17 22:07:08 UTC 2014
On 1/17/14, 3:55 AM, Erik Zachte wrote:
> Here are some charts which breakdown edits into several categories, reverts are counted separately. Of course edits is not editors, but it could be indicative of changed behavior patterns/policies. In the ongoing reassesment of metric definitions one thing discussed is whether we should count productive editors separately (I think we do), and if so on what basis (e.g. x edits per week/month which survived y days of not being reverted).
Thanks, that's interesting information!
Edits-that-survived is one baseline definition of productivity, though
I'd be interested in data with a higher threshold for productive edit,
if there's some way to distinguish them. For example what I really want
to know is what the trends are for article-writing activity, e.g. how
many editors we have who regularly expand articles with new information.
Spot-checking some articles, these edits are by far in the minority, so
in edit counts they are typically *completely* swamped by other kinds of
editing. For example, picking one random article with a short edit
history, here is a manual classification of its 12 edits:
- 2 substantial content additions (original author, plus 1 subsequent
author who added a paragraph)
- 7 bookkeeping/formatting edits (category sorting, maintenance tags,
infobox, bot replacing a template, etc.)
- 3 copyedit edits
Those are all valuable in various ways (and I do all three kinds of
editing myself), but a minority of the edits (2/12) have produced the
vast majority of the article content (and all of its references). What I
fear with the aggregate edit counts is that minor changes in bookeeping
edits swamp even significant changes in the content edits, because the
content edits are a relatively small percentage of the total count. So
I'd be interested in seeing data on those categories separated out,
though it's admittedly difficult to collect.
One mechanically countable statistic might be a count of edits that
include a reference. Would miss references not added with a <ref> tag or
other easily recognizable pattern, but might produce interesting trends
More information about the Wikimedia-l