As part of its product development program, the Wikimedia Foundation's Tech Department
will be releasing regular data dumps for all the features that are currently being
implemented. The first weekly dumps from the Article Feedback Tool – an experimental
feature to engage readers to interact with Wikipedia's contents via a quality rating
system [1] – are available since this afternoon [2]. The latest datasets contain raw
ratings data collected each week from a random sample of 100K articles of the English
Wikipedia. More datasets will be released in the coming weeks as we deploy new features.
Over the summer a new series of datasets produced by the participants in the Wikimedia
Summer of Research [3] will be released and an open data repository will be announced to
host and permanently identify these datasets. Further details on this program and
WMF's open data policy will follow on the Foundation's blog and on this list.
Dario
[1]
http://www.mediawiki.org/wiki/Article_feedback
[2]
http://www.mediawiki.org/wiki/Article_feedback/Data
[3]
http://meta.wikimedia.org/wiki/Research:Wikimedia_Summer_of_Research_2011
--
Dario Taraborelli, PhD
Senior Research Analyst
Wikimedia Foundation
http://wikimediafoundation.org
http://nitens.org/taraborelli