Hi Andreea,

I have processed most of the RfA data of the german wikipedia with a imperfect workflow of

1. a perl script that fetches the data (http://en.wikipedia.org/wiki/Perl)

2. a manual correction of exceptions and

3. a validation and import by an GNU R script. (www.r-project.org/)

Because of exceptions the process can as far as I see not be fully automated. The perl script is just a quick hack and I had to change it during the process. So it is still a lot of manual work as long as there is no better implementation of votes in the wiki engine.

Contact if you want me to send it to you.

----

(Besides many questions that I wanted to answer but wasn't able to answer (-; ) I am interested in the simple question: What dertermines vote decisions of individuals in WP?

I expect "pro"-votes to be

1. a mixture of expression of TRUST in (a) the candidate, (b) the nominating user or (c) users that have voted before.

2. a bounded rational (especially affective distorted) decision about the expected benefit of a candidate

(a) according to OWN political and organisation-political INTERESTS and

(b) according to the perceived benefit to ORGANISATIONAL INTERESTS.

I am about to test the effect of different user variables (aggregated of logged user actions like revisions and user relations that can be accessed from within the https://wiki.toolserver.org/view/Main_Page) on vote decisions.

As far as I see that is complementary to surveying the voters what Ben is about to do. From a sociological point of view what participants think and say musn't have to be exactly what they are doing.

Best wishes from Germany

Marc