Hi Andreea,
I have processed most of the RfA data of the german wikipedia with a imperfect workflow of
2. a manual correction of exceptions and
3. a validation and import by an GNU R script. (www.r-project.org/)
Because of exceptions the process can as far as I see not be fully automated. The perl script is just a quick hack and I had to change it during the process. So it is still a lot of manual work as long as there is no better implementation of votes in the wiki engine.
Contact if you want me to send it to you.
----
(Besides many questions that I wanted to answer but wasn't able to answer (-; ) I am interested in the simple question: What dertermines vote decisions of individuals in WP?
I expect "pro"-votes to be
1. a mixture of expression of TRUST in (a) the candidate, (b) the nominating user or (c) users that have voted before.
2. a bounded rational (especially affective distorted) decision about the expected benefit of a candidate
(a) according to OWN political and organisation-political INTERESTS and
(b) according to the perceived benefit to ORGANISATIONAL INTERESTS.
I am about to test the effect of different user variables (aggregated of logged user actions like revisions and user relations that can be accessed from within the
https://wiki.toolserver.org/view/Main_Page) on vote decisions.
As far as I see that is complementary to surveying the voters what Ben is about to do. From a sociological point of view what participants think and say musn't have to be exactly what they are doing.
Best wishes from Germany
Marc