Thank you for these showcases, they are great. I'm a fan of using session data as a baseline metric; kudos to Oliver for this work.
Is there a catalog of all data that could possibly be available (for instance, the mw.session cookie), along with where it is logged, for how long, and where in various toolchains it gets stripped out?
Related lists could be useful for planning: * Limitations our privacy policies place on data gathering (handy when reviewing those policies) * Studies that are easy and hard given the types of data we gather * Wishlists (from external researchers, and from internal staff) of data-sets that would be useful but aren't currently available. Along with a sense of priority, complexity, cost.
On Tue, Feb 25, 2014 at 9:06 PM, Dario Taraborelli dtaraborelli@wikimedia.org wrote:
Starting tomorrow (February 26), we will be broadcasting the monthly showcase of the Wikimedia Research and Data team.
The showcase is an opportunity to present and discuss recent work researchers at the Foundation have been conducting. The showcase will start at 11.30 Pacific Time and we will post a link to the stream a few minutes before it starts. You can also join the conversation on the #wikimedia-office IRC channel on freenode (we'll be sticking around after the end of the showcase to answer any question).
This month, we'll be talking about Wikipedia mobile readers and article creation trends:
Oliver Keyes Mobile session times A prerequisite to many pieces of interesting reader research is being able to accurately identify the length of users' 'sessions'. I will explain one potential way of doing it, how I've applied it to mobile readers, and what research this opens up. (20 mins) https://meta.wikimedia.org/wiki/Research:Mobile_sessions
Aaron Halfaker Wikipedia article creation research I'll present research examining trends in newcomer article creation across 10 languages with a focus on English and German Wikipedias. I'll show that, in wikis where anonymous users can create articles, their articles are less likely to be deleted than articles created by newly registered editors. I'll also show the results of an in-depth analysis of Articles for Creation (AfC) which suggest that while AfC's process seems to result in the publication of high quality articles, it also dramatically reduces the rate at which good new articles are published. (30 mins) https://meta.wikimedia.org/wiki/Research:Wikipedia_article_creation
Looking forward to seeing you all tomorrow!
Dario
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l