Wiki-research-l August 2011

wiki-research-l@lists.wikimedia.org

22 participants
21 discussions

by song＠cs.umn.edu

Pursuant to prior discussions about the need for a research policy on Wikipedia, WikiProject Research is drafting a policy regarding the recruitment of Wikipedia users to participate in studies. At this time, we have a proposed policy, and an accompanying group that would facilitate recruitment of subjects in much the same way that the Bot Approvals Group approves bots. The policy proposal can be found at: http://en.wikipedia.org/wiki/Wikipedia:Research The Subject Recruitment Approvals Group mentioned in the proposal is being described at: http://en.wikipedia.org/wiki/Wikipedia:Subject_Recruitment_Approvals_Group Before we move forward with seeking approval from the Wikipedia community, we would like additional input about the proposal, and would welcome additional help improving it. Also, please consider participating in WikiProject Research at: http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Research -- Bryan Song GroupLens Research University of Minnesota

9 months, 2 weeks

Is use of diacritics common?

by Piotr Konieczny

I wonder if anybody has studied the use of diacritics on Wikipedia? It is my experience that they are commonly used, but some editors are challenging that (http://en.wikipedia.org/w/index.php?title=Wikipedia:Naming_conventions_(use…). I wonder if we have any data on that, or if somebody could create and run some form of query on the database (as I don't code, this is unfortunately beyond my capabilities). -- Piotr Konieczny PhD Candidate Dept of Sociology Uni of Pittsburgh http://pittsburgh.academia.edu/PiotrKonieczny/ http://en.wikipedia.org/wiki/User:Piotrus

11 years, 9 months

Workshop call for participation: WikiLit: Collecting the Wiki and Wikipedia Literature at WikiSym 2011

by Reid Priedhorsky

Hi all, Phoebe Ayers and I are leading a workshop at WikiSym this year, "WikiLit: Collecting the Wiki and Wikipedia Literature". We would love to have your participation! This workshop has three key goals. First, we will examine existing and proposed systems for collecting and analyzing the research literature about wikis. Second, we will discuss the challenges in building such a system and will engage participants to design a sustainable collaborative system to achieve this goal. Finally, we will provide a forum to build upon ongoing wiki community discussions about problems and opportunities in finding and sharing the wiki research literature. For more details, please see: http://www.wikisym.org/ws2011/workshop:wikilit Please do not hesitate to ask questions, either by replying here on the list or by contacting me or Phoebe (psayers(a)ucdavis.edu) directly. Looking forward to seeing you at WikiSym! Reid

12 years, 3 months

wikistream: displays wikipedia updates in realtime

by Ed Summers

I've been looking to experiment with node.js lately and created a little toy webapp that displays updates from the major language wikipedias in real time: http://wikistream.inkdroid.org Perhaps like you, I've often tried to convey to folks in the GLAM sector (Galleries, Libraries, Archives and Museums) just how much Wikipedia is actively edited. GLAM institutions are increasingly interested in "digital curation" and I've sometimes displayed the IRC activity at workshops to demonstrate the sheer number of people (and bots) that are actively engaged in improving the content there...with the hopes of making the Wikipedia platform part of their curation strategy. Anyhow, I'd be interested in any feedback you might have about wikistream. //Ed

12 years, 6 months

Announcing Wikihadoop: using Hadoop to analyze Wikipedia dump files

by Diederik van Liere

Hello! Over the last few weeks, Yusuke Matsubara, Shawn Walker, Aaron Halfaker and Fabian Kaelin (who are all Summer of Research fellows)[0] have worked hard on a customized stream-based InputFormatReader that allows parsing of both bz2 compressed and uncompressed files of the full Wikipedia dump (dump file with the complete edit histories) using Hadoop. Prior to WikiHadoop and the accompanying InputFormatReader it was not possible to use Hadoop to analyze the full Wikipedia dump files (see the detailed tutorial / background for an explanation why that was not possible). This means: 1) We can now harness Hadoop's distributed computing capabilities in analyzing the full dump files. 2) You can send either one or two revisions to a single mapper so it's possible to diff two revisions and see what content has been addded / removed. 3) You can exclude namespaces by supplying a regular expression. 4) We are using Hadoop's Streaming interface which means people can use this InputFormat Reader using different languages such as Java, Python, Ruby and PHP. The source code is available at: https://github.com/whym/wikihadoop A more detailed tutorial and installation guide is available at: https://github.com/whym/wikihadoop/wiki (Apologies for cross-posting to wikitech-l and wiki-research-l) [0] http://blog.wikimedia.org/2011/06/01/summerofresearchannouncement/ Best, Diederik

12 years, 7 months

Wikimedia Research Newsletter launched

by Dario Taraborelli

We are glad to announce the inaugural issue of the Wikimedia Research Newsletter [1], a new monthly survey of recent scholarly research about Wikimedia projects. This is a joint project of the Signpost [2] and the Wikimedia Research Committee [3] and follows the publication of two research updates in the Signpost, see also last month's announcement on this list [4]. The first issue (which is simultaneously posted as a section of the Signpost and as a stand-alone article in the Wikimedia Research Index) includes 5 "in depth" reviews of papers published over the last few months and a number of shorter notes for a total of 15 publications, covering both peer-reviewed research and results published in research blogs. It also includes a report from the Wikipedia research workshop at OKCon 2011 and highlights from the Wikimedia Summer of Research program. The following is the TOC of issue #1: • 1 Edit wars and conflict metrics • 2 The anatomy of a Wikipedia talk page • 3 Wikipedians as "Janitors of Knowledge" • 4 Use of Wikipedia among law students: a survey • 5 Miscellaneous • 6 Wikipedia research at OKCon 2011 • 7 Wikimedia Summer of Research • 7.1 How New English Wikipedians Ask for Help • 7.2 Who Edits Trending Articles on the English Wikipedia • 7.3 The Workload of New Page Patrollers & Vandalfighters • 8 References We are planning to make the newsletter easy to syndicate and subscribe to. If you wish your research to be featured, a CFP or event you organized to be highlighted, or just join the team of contributors, head over to this page to find out how: [5] We hope to make this newsletter a favorite reading for our research community and we look forward to your feedback and contributions. Dario Taraborelli, Tilman Bayer (HaeB) on behalf of the WRN contributors [1] http://meta.wikimedia.org/wiki/Research:Newsletter/2011-07-25 [2] http://en.wikipedia.org/wiki/Wikipedia:Wikipedia_Signpost [3] http://meta.wikimedia.org/wiki/Research:Committee [4] http://lists.wikimedia.org/pipermail/wiki-research-l/2011-June/001552.html [5] http://meta.wikimedia.org/wiki/Research:Newsletter -- Dario Taraborelli, PhD Senior Research Analyst Wikimedia Foundation http://wikimediafoundation.org http://nitens.org/taraborelli

12 years, 7 months

Wikimedia Research Newsletter 1(2)

by Dario Taraborelli

(* apologies for cross-posting *) The second issue of the monthly Wikimedia Research Newsletter is out: http://meta.wikimedia.org/wiki/Research:Newsletter/2011-08-29 In this issue: • Effective collaboration leads to earlier article promotion • Deleted revisions in the English Wikipedia • Wikipedia and open-access repositories • Quality of featured articles doesn't always impress readers • In swine flu outbreak, Wikipedia reading preceded blogging and newspaper writing • Extensive analysis of gender gap in Wikipedia to be presented at WikiSym 2011 • "Bandwagon effect" spurs wiki adoption among Chinese-speaking users • In brief You can post suggestions and contributions for the next issue at: http://meta.wikimedia.org/wiki/Research:Newsletter Dario -- Dario Taraborelli, PhD Senior Research Analyst Wikimedia Foundation http://wikimediafoundation.org http://nitens.org/taraborelli

12 years, 7 months

WikiSym 2011 Early-bird registration ends August 29

by Finn Årup Nielsen

WikiSym 2011, The International Symposium on Wikis and Open Collaboration, taking place October 3-5, 2011 in Mountain View, California has early-bird registration that ends August 29. That is today! Register at: http://www.wikisym.org/ /Finn -- Finn Årup Nielsen, DTU Informatics http://www.imm.dtu.dk/~fn/ +45 45 25 39 21.

12 years, 8 months

Two scholarships to attend Wikisym available to UK researchers

by Michael Peel

Hi all, Wikimedia UK is pleased to announce that we are offering two full scholarships to enable UK researchers to attend WikiSym this year. You can find the full information, and details of how to apply, at: http://uk.wikimedia.org/wiki/WikiSym_Scholarships Please let me know if you have any questions, and please feel free to pass this on to anyone you think might be interested. Thanks, Mike Peel Wikimedia UK

12 years, 8 months

Access to page ratings

by Rüdiger Gleim

Hello, I would like to take a sample from the english wikipedia based on page ratings. This requires to extract all page ratings and then pick the best according to specific feedback-labels. There does not seem to be an api call to filter pages directly via their rating. So my approach would be to get all page ratings and then apply my filter criteria. The api calls to access feedback only seem to allow one page per call- which results in a lot of calls ;-). Example: http://en.wikipedia.org/w/api.php?action=query&list=articlefeedback&afpagei… Does anybody know a more polite way to get this information? I have checked the dumps but could not found a suitable archive (at least given by the names). Best wishes, Rüdiger

12 years, 8 months

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Wiki-research-l August 2011