Hi everyone,

As I mentioned in an e-mail last week, we are on the final stages of a large literature review on scholarly research on Wikipedia. We have extracted and organized most of the data and have published it to a Semantic MediaWiki wiki at http://wikilit.refarata.com. [Thanks to emijpr for inspiring the structure through
WikiPapers.] As I indicated, this is sort of an incubator site to finish up our data to prepare for publication, after which we intend to export the data permanently to other sites like AcaWiki and WikiPapers.

Thanks for your responses to my inquiries; we have included abstracts, and the data is dual-licensed as CC-BY-SA and ODC-ODbl (http://opendatacommons.org/licenses/odbl/summary/) [thanks, Dario, for the links!], except for copyrighted abstracts. We have submitted a related presentation proposal for Wikimania 2012 at http://wikimania2012.wikimedia.org/wiki/Submissions/Wikilit:_Ten_years_of_Wikipedia_research.

We are asking the Wikipedia research community to please help us verify the accuracy of our data extraction so far. Practically, if you could at least take a look at your own publications and the publications you know well, that would be great. It's an open wiki, so please make any corrections directly, even anonymously. (However, if you want us to acknowledge your contributions, please create a user account and identify yourself on your user page.) In particular, please help us with the following:
* Please correct any inaccuracies you see, or e-mail us at wikilit@okoli.org to notify us of them.
* Please point out any peer-reviewed journal articles or PhD dissertations we have missed that were published before July 2011; we will certainly add these. (After that, the Wikimedia Research Newsletter began.)
* Please point out any other scholarly studies (especially conference articles and significant non-peer-reviewed work) that you feel should definitely be analyzed in detail. Although we have listed 1,500 conference papers (http://wikilit.referata.com/wiki/List_of_conference_papers), our limited time and resources only permits us to analyze a fraction of them in detail. So, please help us highlight the most important ones that we have not analyzed in detail, with a brief explanation of why they are particularly important.
* Please add any published scholarly studies about Wikipedia that we have left out, regardless of peer review or publication type! Please add your own work! Our restrictions in what we include are purely pragmatic due to time and resource limitations. However, if you add a new article, please be sure to *complete as many input fields as possible*, since we will generally exclude any article with incomplete data in our final analysis.
* Please suggest any data analysis or visualizations you would like to see as we synthesize the data.
* Please give any other feedback or suggestion that can help us make this dataset more useful to researchers! Send comments to wikilit@okoli.org.

The data is publicly available, but this is a beta release and there are probably a lot of errors. We hope to have a stable and very clean dataset within a couple months, both from community help and from our own internal quality control processes; we'll make another announcement when we feel the dataset has reached "featured" quality. In particular, please wait a bit before exporting the data to other research collection websites and wikis until it is in a cleaner state; by then, we'll help make it available in as many export formats as practical.

Regards,
Chitu
For the WikiLit project team: Arto Lanamäki, Mohamad Mehdi, Mostafa Mesgari, Finn Årup Nielsen, Chitu Okoli