Hi,
we have a winner. Because we did everything manually, vote counting has been a bit of work, but now it's done and even with minor errors, the results are fairly clear. The new article count system is going to be based on two factors:
1) An article is counted if, trimmed of all trailing whitespace (blanks, newlines etc.), it is longer than zero bytes (non-empty) AND 2) it contains at least one link.
Redirects, talk pages, user pages and Wikipedia: pages are not counted.
The vote consisted of two parts, the main vote on article size and a vote on further restrictions.
The results for the main vote were, ranked lowest (better) first:
1) zero bytes: 39 votes, averaging 3.1795 2) 100 bytes: 37 votes, averaging 3.4595 3) 20 bytes: 38 votes, averaging 3.8158 4) 250 bytes: 37 votes, averaging 4.027 5) 5 bytes: 36 votes, averaging 4.4167 6) language-dependent: 33 votes, averaging 4.5758 7) 500 bytes: 37 votes, averaging 4.7027 8) dynamic (e.g. depending on stub size): 35 votes, averaging 5.1142
The votes for further restrictions, of which only one is to be picked, were:
1) at least one link: 36 votes, averaging 3.1667 2) independent system for each wikipedia: 31 votes, averaging 4.4516 3) stub flag (stubs excluded from count): 34 votes, averaging 4.5882 4) minimum number of contributors: 35 votes, averaging 4.6857 5) language-dependent punctuation (comma etc): 36 votes, averaging 4.7778 6) two paragraphs minimum: 33 votes, averaging 5 7) <article> tag 33 votes, averaging 5.1515 8) no further restriction 32 votes, averaging 5.1875 minimum number of edits 32 votes, averaging 5.1875 9) divide database size by byte size: 30 votes, averaging 5.4333 10) existing comma requirement 38 votes, averaging 5.5789
I have done my best to avoid errors, and for the first stage have compared with Tomos' count, but I cannot be certain. While an error is unlikely to affect the result, pedants may want to doublecheck just in case. Please note that votes added after yesterday's deadline should not be counted. I also did not count the anonymous vote (6 against dynamic).
Analysis of results ===================
The opinion regarding the main size restriction can be divided into two camps: One group thought simply counting non-blank articles would be enough, the other felt that excluding very short articles would also be necessary. Consensus between these two groups was unlikely. The non-blank camp won by a relatively narrow margin, but those who wanted more restrictions got an important victory in the second stage of the vote: Only articles including at least one link are counted, which excludes most newbie experiments.
This, in my opinion, is an almost perfect result that everyone should be able to live with. It demonstrates well that voting systems can arrive at compromises not just as well, but even better than simple discussions. How long would we have needed to talk to agree on this solution and to determine agreement? My guess is that we would not have arrived at it, ever.
But it is a good solution. There can simply be no valid article in the Wikipedia system without links -- the whole wiki concept depends on high interconnectedness. Material copied from somewhere else is not "wikified", however -- nor are newbie experiments. Such "articles" are now excluded from the count, as they should be. At the same time, we do not choose an arbitrary byte size limit that would always remain arguable. And I do not foresee users adding <!--[[]]--> HTML comments to an article just to have it counted, as has happened with the comma.
Yet, even with this solution, there are people who feel strongly that it is a bad one: 8 people have voted that counting only articles with at least one link is a "very bad" idea (11 people thought it is a very good idea). Once again, it is unlikely that there would have been consensus between these groups, further disproving the consensus model. With groups of 30 and more people, there is simply never going to be "near unanimous consensus" about anything but the most obvious questions.
But some results came as no surprise: Keeping the comma count was almost universally rejected, and it is debatable whether we should have included that option in the first place. Second place in the restrictions stage, with considerable distance, was the option to let each Wikipedia decide on its own. While this option clearly lost, its rank highlights that many Wikipedians want "their" Wikipedia to have room for independent decisions. A similar option should thus be included in future polls.
Finally, I would like to point out that the process has led to a remarkable number of ideas -- some of them awful, sure, but some of them, like the link idea, have never been mentioned on the mailing list. This, too, demonstrates the advantages of a formalized brainstorming process.
Analysis of methodology =======================
What have we learned from the process we used here?
1) The participation rate was very high. Concerns that an option might win because of neglect were unjustified. HOWEVER, options which were added too late tended to gather significantly fewer votes. Options which were apparently written off as unlikely to win also tended to gather fewer votes.
2) Votes tended toward the extremes, i.e. 6 or 1. For most options, however, the entire spectrum was used.
3) The system used therefore allowed us to gather a very large amount of information about the opinions held. It would be difficult, if not impossible, to gather as much information through a non-formalized process.
4) Deadlines and limits on options must be even more strictly enforced. The deadline for proposing options should be longer, but options should be discussed more carefully.
5) The combination of options and the possible requirement to split up voting into stages need to be discussed to avoid ambiguity (e.g. "can more than one option win?").
6) The voting system used takes some effort to handle manually, but would be relatively easy to implement in code. Until we have such a software-based solution, using this system would probably be overkill for small decisions. In any case, we should try consensus finding first.
These are my thoughts for now -- please add yours.
Regards,
Erik
Erik Moeller wrote:
Finally, I would like to point out that the process has led to a remarkable number of ideas -- some of them awful, sure, but some of them, like the link idea, have never been mentioned on the mailing list. This, too, demonstrate the advantages of a formalized brainstorming process.
And some mentioned on the mailling list never get in the voting process. But we got a result, thats the important point.
I hope we get an automatic voting next time, which don't show who's voting what. It will reduce possible influences. Nothing against publishing the result with names, but not during the voting process.
Smurf
Thomas Corell wrote:
I hope we get an automatic voting next time, which don't show who's voting what. It will reduce possible influences. Nothing against publishing the result with names, but not during the voting process.
Oh, I think publishing the names as we go is a very good thing. I think possible influences are good. If people I respect are voting differently from me, it may give me pause. If people I don't respect are voting the same as me, it may give me pause.
Voting isn't supposed to be an alternative to open discussion and consensus building, but a methodology to formalize it.
--Jimbo
Thomas Corell wrote:
I hope we get an automatic voting next time, which don't show who's voting what. It will reduce possible influences. Nothing against publishing the result with names, but not during the voting process.
Oh, I think publishing the names as we go is a very good thing. I think possible influences are good. If people I respect are voting differently from me, it may give me pause. If people I don't respect are voting the same as me, it may give me pause.
Voting isn't supposed to be an alternative to open discussion and consensus building, but a methodology to formalize it.
I agree with Jimbo here, for the reasons I have already explained on Talk:Article count reform. Here's a copy:
Transparency works both ways -- it may encourage groupthink, but it also lets you use strong votes to express dissent with options you dislike that seem to be winning, esp. in a preferential system. In my experience, hiding votes sounds like a good idea at first, but works badly in practice because what is often snobbishly called groupthink is really valuable information that you don't want to do without once you lose it. Should I bother reading this option if all people I trust have rejected it? Should I maybe give this a closer look if people I don't trust have buried it in negative votes? I also find it funny that the argument "groupthink" should be used against open voting, whereas we try to strive for consensus without voting whenever possible, a process which is much more likely to encourage groupthink ("Gee, I don't want to stop the consensus!").
Secrecy is necessary for real world votes, but here we can afford some openness. It's the wiki way.
Regards,
Erik
Erik Moeller wrote:
Secrecy is necessary for real world votes, but here we can
afford some openness. It's the wiki way.
One reason secrecy is important in real world political voting is the possibility of backlash against those who vote on the losing side. That's not really possible here. The winners can't round up the losers into concentration camps.
--Jimbo
On Tue, 18 Mar 2003, Jimmy Wales wrote:
Erik Moeller wrote:
Secrecy is necessary for real world votes, but here we can
afford some openness. It's the wiki way.
One reason secrecy is important in real world political voting is the possibility of backlash against those who vote on the losing side. That's not really possible here. The winners can't round up the losers into concentration camps.
It depends... What about the next vote being to ban the losers from all Wikipedias except the Russian one?
Andre Engels
On Tue, 18 Mar 2003, Jimmy Wales wrote:
Erik Moeller wrote:
Secrecy is necessary for real world votes, but here we can
afford some openness. It's the wiki way.
One reason secrecy is important in real world political voting is the possibility of backlash against those who vote on the losing side. That's not really possible here. The winners can't round up the losers into concentration camps.
It depends... What about the next vote being to ban the losers from all Wikipedias except the Russian one?
Well, what about a "nearly unanimous consensus" to do that (decided by Jimbo)?
On 3/18/03 11:00 AM, "Erik Moeller" erik_moeller@gmx.de wrote:
Secrecy is necessary for real world votes, but here we can afford some openness. It's the wiki way.
Yeah, but voting *isn't* the wiki way.
On 3/18/03 11:00 AM, "Erik Moeller" erik_moeller@gmx.de wrote:
Secrecy is necessary for real world votes, but here we can afford some openness. It's the wiki way.
Yeah, but voting *isn't* the wiki way.
Obviously we have different definitions.
Regards,
Erik
(Erik Moeller erik_moeller@gmx.de):
On 3/18/03 11:00 AM, "Erik Moeller" erik_moeller@gmx.de wrote:
Secrecy is necessary for real world votes, but here we can afford some openness. It's the wiki way.
Yeah, but voting *isn't* the wiki way.
Obviously we have different definitions.
I'm with Cunc on that one: the Wiki way is jump first, ask questions later. I'm pretty down on voting as a way to accomplish anything useful, but the openly-signed voting process is at least better. It's more like a record of discussion than a vote. The Wiki way would then be for the person actually writing the code to look at the discussion and let it influence him--but not 100%. If he thought the second most popular option was the one he wanted to implement, he should do that.
Secrecy is necessary for real world votes, but here we can afford some openness. It's the wiki way.
Yeah, but voting *isn't* the wiki way.
I agree that the openness is doable... just disagree with the method... anyone could have set up a form and a server script to compile the results, and published those results daily so that we could see the innards... I find arguments against increasing efficiency, without corrupting collegiality, as counterproductive... and certainly will bite WP in the ass if this philosophy carries over into streamlining heavy bandwidth/editing issues, if this project ever *really takes off... ;) -SV
Jimmy Wales wrote:
Oh, I think publishing the names as we go is a very good thing. I think possible influences are good. If people I respect are voting differently from me, it may give me pause. If people I don't respect are voting the same as me, it may give me pause.
The programmer inside me cries "deadlock";) You wait until some/all of the respected and some/all of the others made their vote, to know what you will vote? If this will be done by everyone, voting won't work.
I think time for influences is in the discussion before the voting and not during voting. But this topic isn't realy important in case of wikipedia desissions.
For example: one important thing (IMHO) for Wikipedia is the speed of a database request. In the voting and in the discussion about the Article count nobody calculated the complexity of some of the options. I think some of the admins will refuse methods for this count which will need a lot of performance. And I would respect such a decision.
Smurf
On Tue, 2003-03-18 at 08:19, Thomas Corell wrote:
For example: one important thing (IMHO) for Wikipedia is the speed of a database request. In the voting and in the discussion about the Article count nobody calculated the complexity of some of the options. I think some of the admins will refuse methods for this count which will need a lot of performance. And I would respect such a decision.
The live article count is maintained on an action-by-action basis. That is, there's a stats table in the database that keeps track of a couple of numbers, including the article count. When an article is created, deleted, or modified, the software compares the previous text (if any) with the new text (if any) and either increments or decrements the counter if appropriate. Displaying the counter is simply a matter of fetching one number from a table.
So there's really not much database impact; some of the ideas about number of revisions or number of people editing would require a check of the database for the article's history, but only individually as part of the process of changing an article (and it'd be a drop in the bucket between updating link tables and whatnot!)
-- brion vibber (brion @ pobox.com)
(Stevertigo stevertigo@attbi.com):
I endorse this result.
This sounds kinda like the CIA putting its stamp of approval on a third world election... ;)
...except that Jimbo probably didn't engineer the result he's endorsing.
On Tue, 18 Mar 2003 14:12:11 -0600, Lee Daniel Crocker lee@piclab.com wrote:
(Stevertigo stevertigo@attbi.com):
I endorse this result.
This sounds kinda like the CIA putting its stamp of approval on a third world election... ;)
...except that Jimbo probably didn't engineer the result he's endorsing.
If the US was a third world country it's last presidential election would have been re-run with UN supervisors! :-)
I didn't get a chance to vote, BTW - I remembered about the poll twice - once I spent 10 minutes trying to find the page, by which time Wikipedia became unresponsive, and the second time happened to be during last night's outage. (reminder messages with the URL would ahve been a very good idea - in fact, a mailing to every registered user with the URL of the page would have been even better.
On Wed, 19 Mar 2003, Richard Grevers wrote:
I didn't get a chance to vote, BTW - I remembered about the poll twice - once I spent 10 minutes trying to find the page, by which time Wikipedia became unresponsive, and the second time happened to be during last night's outage. (reminder messages with the URL would ahve been a very good idea - in fact, a mailing to every registered user with the URL of the page would have been even better.
We can do like in Florida, count the absentee ballots postmarked after the election. :)
-- Daniel
On 18 Mar 2003, Erik Moeller wrote:
These are my thoughts for now -- please add yours.
In my opinion, the voting period should be lengthened to AT LEAST 1 week. I wanted to put a message on WikipediaNL, notifying people of the vote, but I forgot it at first, and then it wasn't necessary anymore as the voting period had already ended.
Andre Engels
Erik Moeller wrote:
- An article is counted if, trimmed of all trailing whitespace (blanks,
newlines etc.), it is longer than zero bytes (non-empty) AND 2) it contains at least one link.
This is almost exactly the result that I wanted. The only thing that I'd prefer is to let each wiki decide for itself, but that takes a lot more coding.
I have done my best to avoid errors, and for the first stage have compared with Tomos' count, but I cannot be certain. While an error is unlikely to affect the result, pedants may want to doublecheck just in case. Please note that votes added after yesterday's deadline should not be counted. I also did not count the anonymous vote (6 against dynamic).
And since Jimbo endorsed the result, you didn't count my vote, right? (See my comment on the talk page.)
Finally, I would like to point out that the process has led to a remarkable number of ideas -- some of them awful, sure, but some of them, like the link idea, have never been mentioned on the mailing list. This, too, demonstrates the advantages of a formalized brainstorming process.
A formalized brainstorming process? Certainly a good idea. Since the first of your voting pages that I saw, I've thought that they were a good way to get people to come out and talk. It's insisting that the results be valid that is problematic.
- The system used therefore allowed us to gather a very large amount of
information about the opinions held. It would be difficult, if not impossible, to gather as much information through a non-formalized process.
Same comment as above.
- The combination of options and the possible requirement to split up voting
into stages need to be discussed to avoid ambiguity (e.g. "can more than one option win?").
In this case, a maximum of 1 restriction other than size was allowed. Indeed, several things like this were decided by the voting administrator, not the voters.
These are my thoughts for now -- please add yours.
Oh, and I like approval voting better than this method. ^_^
-- Toby
--- Erik Moeller erik_moeller@gmx.de wrote: 4) Deadlines and limits on options must be even more strictly enforced. The deadline for proposing options should be longer, but options should be discussed more carefully.
In any case, we should try consensus finding first. These are my thoughts for now -- please add yours. Regards, Erik
Less than 48 hours to set options, discuss options, formalize options, in 40 languages, is not enough. Especially when the server is basically down during most wake hours. In the end, unless a lot of bantering noise is done, only a couple of people from each language participate, and these latter don't necessarily represent the widest view on their wikipedia.
PS : I am glad at least the [[]] was voted as a restriction.
__________________________________________________ Do you Yahoo!? Yahoo! Platinum - Watch CBS' NCAA March Madness, live on your desktop! http://platinum.yahoo.com
wikipedia-l@lists.wikimedia.org