If anybody is interested, I've made a list of 'most reverted pages' in the
english wikipedia based on the analysis of the enwiki-20100130 dump. Here is
the list:
http://wpcvn.com/enwiki-20100130.most.reverted.tar.bzhttp://wpcvn.com/enwiki-20100130.most.reverted.txt
This list was calculated using the following sampling criteria:
* All pages from the enwiki-20100130 dump;
** Filtered pages with more than 1000 revisions;
** Filtered pages with revert ratios > 0.3;
* Sorted in descending revert ratios.
Page revision is considered to be a revert if there is a previous revision
with a matching MD5 checksum;
BTW, if anybody needs it, the python code that identifies reverts, revert
wars, self-reverts, etc is available (LGPL).
-- Regards, Dmitry
Dear All,
we have put together APIs that make available for each revision:
- A 'quality score' for the revision, which is 1 minus our estimated
probability that the revision is vandalism. That is, this notion of quality
only captures "absence of vandalism".
- A bunch of raw scores, which include the reputation of the author,
information on how much the text has been revised by a mix of different,
trusted authors, size of the change from the previous revision to this one,
and more.
Details of the API calls are at http://www.wikitrust.net/vandalism-api .
The data should be available for all revisions. Occasionally, we might lack
information for a recent revision.
Ideally, if a revision is missing, we would put it in a queue of revisions
to be processed; I am not sure whether this mechanism is in place already
though; but if not, we will provide it soon.
I hope this is of interest! We are looking to add more APIs, but this is a
start.
Luca
Hi all,
I'm Steven Walling, a longtime Wikimedia volunteer. Damian Finol (also a
longtime volunteer) and I are working on the beginnings of a new research
project in cooperation with the Foundation's Community Department.
Everyone knows that editors are publicly listed based on edit count, and
some other details are visible related to the type of contributions an
individual user makes.
The goal of this project is to try and highlight highly active volunteers
who may not participate in tasks that produce a high edit count. By creating
a detailed taxonomy of sorts for all the different roles users can take in a
project, we hope to get a better picture of who the most active contributors
are and what they are doing.
If anyone has done similar roles-based investigations into volunteer
participation or has any suggestions at all, please feel free to contact us
at feedback(a)wikimedia.org.ve or via the project's page on Meta at
http://meta.wikimedia.org/wiki/Contribution_Taxonomy_Project
Thanks,
Steven Walling
http://enwp.org/User:Steven_Walling
[[Wikipedia:Ein kritischer Standpunkt]]
September 25-26, 2010
University Library Leipzig, Germany
On 25th and 26th of September 2010 the German speaking conference [[Wikipedia:Ein kritischer Standpunkt]] ([[Wikipedia:Critical Point of View]]) will take place at the University Library in Leipzig, Germany. The conference will gather Wikipedia researchers, critics as well as community-members from the German-speaking world for an interdisciplinary debate. In particular the significance of Wikipedia for education, politics, culture and society will be discussed.
Wikipedia is one of the largest, if not the largest, self-contained general knowledge reference of our time. It offers critical insights into the contemporary status of knowledge, its organizing principles, function, impact, production styles, mechanisms for conflict resolution, and relation to power (re-)constitution. New strategic and tactical operations of knowledge and power are clearly at work through Wikipedia. Of specific interest is the concept of 'the open', which is ambiguous within the social formation(s) constituted by Wikipedia, serving as both a rallying concept of digital democracy enthusiasts and as an ideoglical nodal point masking new agonistic encounters.
In both material and perceptional ways, every new technology modifies the conditions of possibility for knowledge. The logic of technologies bleeds into the very structures and organizing principles of knowledge, and today both medium and message may reflect the ideas of the (organized) network, multitude, or the Deleuzian machine. It is through a selected mix of technological and normative conditions – the distributed architecture of the net, the Wiki software platform, commons-based property licenses and the FLOSS zeitgeist – that Wikipedia as the encyclopedia of the information age emerges, both continuing and transforming the Enlightenment encyclopedic impulse or will to know.
The main topics of the conference are Wikipedia & The Politics of Open Knowledge, Digital Governance, and Wikipedia & Education. These topics derive from the significance of the online encyclopedia in the reconfiguration of knowledge (re-)production and its consequences for the public, architectures of participation, and political education in a media democracy. Alongside presentations of established scholars like Christian Stegbauer, Peter Haber, Rainer Hammwöhner, Ramón Reichert, and Ulrich Johannes Schneider, the programme of the conference will consist of a panel discussion of Wikipedia community-members and critics, as well as Wikipedia-workshops and a research network meeting.
The research network meeting addresses Wikipedia researchers to discuss their current research and draft new research projects. Especially aimed for young academics, the research network meeting is planned as open space, allowing its participants to actively engage in the event as questions and topics are shaped and discussed among the group. To participate, we ask for a registration by email not later than August 31, 2010 to info(a)cpov.de. Please include a description of your research interest or abstract of your research on one page and tell us, if you are interested to make a short presentation.
The Leipzig conference continues the series of international conferences of the Wikipedia Research Initiative Critical Point of View from January and March 2010 in Bangalore (India) and Amsterdam (Netherlands). It is hosted by cultiv – Gesellschaft für internationale Kulturprojekte e.V. in cooperation with the Research Initiative Critical Point of View and funded by the Bundeszentrale für politische Bildung.
The conference will be open to the public. There will be no participation fee. Conference language is German.
For further information please visit the conference website: www.cpov.de
Deadline for the Registration for the network meeting: August 31, 2010
Concept and Editorial board: Geert Lovink, Johanna Niesyto and Andreas Möllenkamp
Contact
cultiv
Gesellschaft für internationale Kulturprojekte e.V.
Bernhard-Göring-Str. 65
D-04107 Leipzig
Tel. +49-341-2228893
Email: info(a)cpov.de
www.cpov.de
Hi Piotr,
Stats on the number of new editors, and other metrics like that, are
held on the stats site: http://stats.wikimedia.org. Particularly,
on http://stats.wikimedia.org/reportcard/ you will find (under the
"community" section" the specifics about that measure.
Philippe
____________________
Philippe Beaudette
Head of Reader Relations
Wikimedia Foundation
philippe(a)wikimedia.org
Imagine a world in which every human being can freely share in
the sum of all knowledge. Help us make it a reality!
http://wikimediafoundation.org/wiki/Donate
>
> ---------- Forwarded message ----------
> From: Piotr Konieczny <piokon(a)post.pl>
> Date: Tue, Aug 10, 2010 at 1:21 PM
> Subject: Re: [Wiki-research-l] WMF Staff Introductions.
> To: Research into Wikimedia content and communities <wiki-research-l(a)lists.wikimedia.org
> >
>
>
> Welcome!
>
> I have to say that
> http://strategy.wikimedia.org/wiki/
> Former_Contributors_Survey_Results of
> which I've just learned from your post is an excellent piece of
> research, one that was needed for a very long time.
>
> One question comes to mind: we now, roughly, how many editors we are
> gaining per months. Are there any estimates on how many we are losing
> (per month, year, total)? I cannot find such numbers in that survey.
>
> --
> Piotr Konieczny
>
> Parul Vora wrote:
> > Hello everyone,
> >
> > We (most of the current staff at the Wikimedia Foundation currently
> > engaging in research) had a chance to meet some of you at Wikisym
> and
> > Wikimania this year and thought it would be nice to introduce
> ourselves
> > and say hi to all of you! All of us have joined WMF in the past two
> > years and are working on projects or research questions that may be
> > relevant or of interest to all of you. Also, as far as I know, we
> are
> > all new to this list and will hopefully be talking and collaborating
> > with you more in the future - both here and on the Meta Research
> page.
> >
> > So, in no particular order, some introductions from all of us:
> >
> > From Nimish <http://wikimediafoundation.org/wiki/
> User:Nimish_Gautam>
> > (ngautam(a)wikimedia.org): Hi, I'm Nimish Gautam. I started with the
> > foundation in 2009 doing development for the Usability Initiative,
> which
> > focused on new editors. I like analyzing user behavior to figure
> out how
> > people use the tools we give them (turning templates into a
> programming
> > language, who would've guessed?) and spotting trends so we can
> improve
> > those tools to help people accomplish what it is they're trying to
> do.
> > Currently I'm doing qualitative and quantitative research on user
> > behavior for the foundation and its various projects, and very
> > interested in finding ways of chunking all this information
> together to
> > make pretty, compelling, informative resources so people know what's
> > going on in the wikiverse and hopefully want to be a part of it.
> >
> > From Howie
> > <http://wikimediafoundation.org/wiki/Staff#User_Experience_Programs>
> > (hfung(a)wikimedia.org): Hello! I'm Howie and I'm a Senior Product
> Manager
> > at the Wikimedia Foundation. As a product person, I'd like to work
> with
> > the community towards more data-driven decision making. One area I'm
> > particularly interested in is getting a better understanding of our
> > user's lifecycle with our projects -- how they come to the
> projects, how
> > they start contributing, their experiences as a contributor, why
> they
> > leave, and why they return. I like to use both quantitative and
> > qualitative methods to obtain as complete a picture as we need to
> guide
> > our decisions. On the quantitative side, I'm working on getting
> better
> > web analytics for our projects. I'm also interested in any data
> mining
> > projects along these lines (e.g., contribution behavior, user
> lifecycle
> > patterns, etc.). On the qualitative side, I worked on the "Why
> Editors
> > Leave Wikipedia" survey and would be interested in other qualitative
> > measurements (e.g., interviews, surveys, focus groups). If you're
> > interested in any of the above topics, please drop me a line.
> >
> > From Amy
> > <http://wikimediafoundation.org/wiki/Staff#Public_Policy_Initiative>
> > (aroth(a)wikimedia.org): Hi, I'm Amy, the Research Analyst for the
> Public
> > Policy Initiative
> > <http://outreach.wikimedia.org/wiki/Public_Policy_Initiative>. My
> task
> > is to assess the project's impact on: U.S. public policy article
> > quality, public policy categorization, new articles, and new
> > contributors. Through the project I have focused on article quality
> > assessment, and worked with the community to add a quantitative
> value to
> > the current article assessment
> > <http://en.wikipedia.org/wiki/Wikipedia:WikiProject_United_States_Public_Pol…
> >.
> > As a data analyst, I am interested in improving data accessibility
> from
> > Wikipedia. In my dreams, I envision data from the assessment tools
> that
> > exist within Wikipedia are captured in a real-time database, so
> that we
> > can observe what is currently happening in Wikipedia and how it is
> > evolving in the present, rather than having to use data dumps to get
> > snapshots of the state of Wikipedia. I have experience analyzing and
> > designing surveys and would like to use that experience to take a
> more
> > in depth look at contributor demographics and motivations. I am
> excited
> > to be a part of this huge collaborative project with a mission to
> make
> > knowledge accessible.
> >
> > >From me, Parul Vora
> > <http://wikimediafoundation.org/wiki/User:Parulvora>
> > (pvora(a)wikimedia.org): Hi Everyone! I'm a researcher and designer
> with a
> > focus on participatory and collaborative spaces. I started at the
> > Wikimedia Foundation in 2009 and moving forward have interest in:
> > creating new forms of participation (beyond editing) on the projects
> > that better engage a wider audience with the content and each other;
> > assessing, evaluating and addressing the demographic and cultural
> biases
> > in our projects; and exploring location, culture and language as
> they
> > affect the development patterns of different language Wikipedias
> in an
> > effort to identify potential for experimentation and catalysis in
> > younger projects. I'm currently exploring the potential effect
> feedback
> > systems (article ratings, expert reviews, visualizations of an
> article's
> > history or a user's contributions) can have on the engagement of
> > readers, actions of editors, and the quality of content over time. I
> > like infovis, ux research, and unresearched innovation and I am
> > interested in learning more about research with wikipedia on
> motivation,
> > behavioral economic modeling and/or game theory, using geolocative
> data,
> > mobile experiences, and profiling and trend
> visualizations......and your
> > work too!
> >
> > Let us know if you're interested in learning more, participating
> in, or
> > contributing to our efforts. And drop any of us a line if we could
> learn
> > from or contribute to what you've been working on.......
> >
>
>
> _______________________________________________
> Wiki-research-l mailing list
> Wiki-research-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>
Hello all,
The Wikimedia Foundation is looking for volunteers who would like to
support the management of relationships between Wikimedia communities
and the broader communities of researchers who study Wikimedia
projects. We hope to create a committee with volunteers from both
groups with a rich combination of skills and backgrounds.
Here are some areas of work that this new Wikimedia Research
Committee, with help from the Wikimedia Foundation staff, is intended
to explore:
* developing policy around researcher permissions for non-public data
* supporting the development of subject recruitment processes
* reviewing research projects when conflicts-of-interest arise
* articulating and channeling requests for data and technical resources
* helping to formulate the key strategic research objectives of the
Wikimedia movement (see strategy.wikimedia.org)
* helping to formulate small tactical experiments related to
Wikimedia's strategic goals
* developing an open access policy as a requirement for significant
support from the Wikimedia Foundation
* helping create a "starter kit" for researchers to avoid duplication of effort
To date, many of these issues have been handled by the Wikimedia
Foundation on a case-by-case basis, with various points of contact and
unclear responsibilities. The research committee will act as a first
point of contact for many, if not most, of these requests, and help us
to formulate processes and policy beyond the boundaries of projects
and languages.
We are estimating the minimum time commitment for membership in this
committee to be 2-4 hours per week.
To be clear, we are also hiring additional research and engineering
staff. This committee is intended to work in close partnership with
Wikimedia Foundation staff.
If you are interested in joining this committee, please write an
off-list e-mail response to <erik at wikimedia dot org> with the
following information:
1) Your name and background;
2) Explanation of your role and interest in research projects and/or community;
3) Key areas of activity you'd like to focus on as a member of such a committee.
This will help us seed the committee with a diverse group of
individuals. The committee can later be expanded beyond the initial
membership, once internal processes for doing so are established
(similar to the chapters committee).
We look forward to hearing from you. :-) Please disseminate this
message further where relevant. We will aim to respond to applications
within two weeks.
All best,
Erik
--
Erik Möller
Deputy Director, Wikimedia Foundation
Support Free Knowledge: http://wikimediafoundation.org/wiki/Donate