Thanks for forwarding this Sumana, just a quick note that the DataHub seems to have
problems with file uploads at the moment, so expect the full non-enwiki data to be
available in the coming days.
On Oct 12, 2012, at 2:23 PM, Sumana Harihareswara <sumanah(a)wikimedia.org> wrote:
Thought the researchers on this list might enjoy
having another dataset
to play with. :-)
Engineering Community Manager
-------- Original Message --------
Subject: Re: [Wikitech-l] Prefs removal
Date: Fri, 12 Oct 2012 14:18:37 -0700
From: Erik Moeller <erik(a)wikimedia.org>
Reply-To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
To: Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
On Mon, Oct 8, 2012 at 11:12 PM, Erik Moeller <erik(a)wikimedia.org> wrote:
I've pinged analytics to see if they can get
us a better prefs report.
Dario Taraborelli has kindly made a detailed dataset of preferences
available. He can chime in with details if needed.
The dataset can be found here:
These datasets capture the following:
- user_properties set to '' from the default -> active_prefs_0
- user_properties set to 1 from the default -> active_prefs_1
-> NOTE: This can be very misleading for non-boolean prefs
- user_properties set to !='' from the default -> active_prefs_all
This is based on a dataset of _active_ users which is included.
Unchanged prefs aren't included, specific non-boolean settings aren't
included, and prefs with < 5 users aren't included. This is en.wp,
other languages to follow.
Note there's all kinds of funkiness with how prefs may be serialized
to the DB - prefs names and defaults may have changed, prefs may have
been removed, and some stuff is serialized when it doesn't need to be
(suggesting that prefs have been changed, when in fact the user has
just performed a pref save). But for a lot of the more obscure prefs
this should be good initial guidance.
Will post some initial observations to
VP of Engineering and Product Development, Wikimedia Foundation
Support Free Knowledge: https://wikimediafoundation.org/wiki/Donate
Wikitech-l mailing list
Wiki-research-l mailing list