Again, thank you for your responses.
As I stated previously, our research is more on the Jeremy Foote's line: We want to study the "wiki survival" or "wiki mortality" in highly diverse wiki ecosystem as it is Wikia and we want to focus more in *the death of wikis as a community*, so probably not so related to how much useful or outdated is the knowledge posted.
The first step to approach this is to define "Death". Indeed, It can correspond somehow with the activity of the wiki, as the activity represents the aliveness of a community. But definitely bots, changes made globally to all wikis by Wikia and, maybe, small format corrections should be kept apart. Thank you all for pointing those out!
There is a major question that maybe is the key to unlock the definition of these concepts:
*Is inactivity equal to death community or is inactivity just an indicator of possible death?*
So, in the first definition I brought out, they used the term "dormant", leaving some room for the hope that the wiki can be retaken or awaken at some later point. So I believe that, what we are looking actually forward is for a definition closer to what Leila said about dropping-out users editors (Thank you very much for the reference!), but applied to wikis, so something like "dropped-out wikis" could be what we are looking for in order to have a proxy for wiki death or wiki abandonment. @Leila: Do you think that we could do some equivalent study to evaluate when a wiki shifts from "inactive" to "abandon" ? Is this what you meant when saying "inactive may or may not be equal to dead in your definition."? Or what did you mean by saying so?
I want to add, and I think it's worth noting, that apart from activity data, we also have data regarding the visits to the wiki. This give us an idea on how much attention or usefulness the wiki is still providing to visitors. But I find this data more related to the interest or, even, "activity" of the wiki rather than the aliveness of the community behind.
El mar., 6 nov. 2018 a las 14:03, Leila Zia (leila@wikimedia.org) escribió:
Hi Abel,
Some years back, Erik Zachte and I came up with a proposal related to your question. (inactive may or may not be equal to dead in your definition.)
A project is considered active if it satisfies the following conditions:
- The wiki should be hosted by the Wikimedia Foundation
- Open (not closed/locked or private)
- The wiki should have less than a very permissive threshold of
activity: At least three active editors with 5+ edits/month/editor needed. This threshold is perhaps too permissive but it already cuts away a large number of almost inactive projects.
We haven't put the above anywhere public (we totally should).
And since you may run into this other question soon, too: If you want to see how many months of inactivity for a username means the user has dropped out of the project, please check section 1B in https://meta.wikimedia.org/wiki/Research:Understanding_thanks#Discussion.
Best, Leila
On Mon, Nov 5, 2018 at 2:05 PM Kerry Raymond kerry.raymond@gmail.com wrote:
While perhaps not relevant to the original enquiry ...
For Wikipedia, do we need to know (or care) about the "death" of:
- a WikiProject
- a Portal
- a category
- an article
While there are articles on historical topics which might, once written,
arguably not need further updating, there are many articles, categories, portals, projects which do as they involve topics that are current in the real world. As a simple example, for a town, we report on population and temperatures. For an electorate, there are elections and new people elected. Sports results etc. If readers visit articles with out-of-date information, they may see less value in the article and therefore Wikipedia as a whole. Have we ever thought of having some system of tagging articles of this nature, either as a whole or in sections or in infobox fields that indicates when the information could be considered out of date. E.g. we have the Australian census every 5 years (the last being 2016). It takes them about a year to release the data to the public (so 2017 we had first access to the 2016 data), so we might think it desirable that all Australian places with census data have been updated by 2018 and certainly we would not think it acceptable if there were any with 2006 (or earlier) census data (except as historical information). Yet of course there are many such out-of-date Australian articles, but probably we don't have an easy way to know which ones. (Before anyone rushes to tell me about Wikidata solutions, I would point out that the average Australian Wikipedia editor neither knows nor cares about Wikidata and our attempt to add 2016 census data from Wikidata more-or-less collapsed from lack of community support). I'm thinking here about solutions that Wikipedians might understand, such as templates which have a tracking category that is activated when the article misses an update deadline based on some template in the article.
Of course, on Wikipedia, many articles have the illusion of being
actively maintained in the sense of edits occurring, thanks to vandalism and reverts of vandalism, endless re-categorisation, automated changes of a trivial nature (e.g. dash length), the Internet Archive Bot and other bots, copyedits etc. As someone who does her watchlist diligently, I am seeing increasing activity over articles (my daily watchlist seems to be growing faster than the number of entries on my watchlist) which suggests we are more active, but, when I look at the edits, relatively few of them are updates to the information content. So activity should not be equated to information currency. Note, as anyone who deals with visible metrics soon learns, people game them and our edit counts are a classic example. I sometimes wonder what would happen if we suppressed that information. Or better still, counted something that we value more than "number of times clicked Save". What if we only counted the number of citations added (or counted it in addition to plain old edit count)? Would that drive behavioural change from less information-productive activities towards more information-productive activities?
But if we can have some measure of information-activity/inactivity for
an article, then I presume we can aggregate this across any natural groupings of articles (e.g. Category trees, Portals, WikiProjects) to discover where we are stagnating and then let humans decide if that topic space is one that can stagnate (because it is historic) or one that must be updated periodically to be considered useful and whether the correct frequency of updates seems to be occurring, either macroscopically or (ideally) microscopically around particular time-sensitive factoids.
Can we measure "information growth"?
Kerry
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l