On Mon, Oct 8, 2012 at 11:12 PM, Erik Moeller erik@wikimedia.org wrote:
I've pinged analytics to see if they can get us a better prefs report.
Dario Taraborelli has kindly made a detailed dataset of preferences available. He can chime in with details if needed.
The dataset can be found here:
http://thedatahub.org/dataset/wikipedia-user-preferences
These datasets capture the following:
- user_properties set to '' from the default -> active_prefs_0 - user_properties set to 1 from the default -> active_prefs_1 -> NOTE: This can be very misleading for non-boolean prefs - user_properties set to !='' from the default -> active_prefs_all
This is based on a dataset of _active_ users which is included. Unchanged prefs aren't included, specific non-boolean settings aren't included, and prefs with < 5 users aren't included. This is en.wp, other languages to follow.
Note there's all kinds of funkiness with how prefs may be serialized to the DB - prefs names and defaults may have changed, prefs may have been removed, and some stuff is serialized when it doesn't need to be (suggesting that prefs have been changed, when in fact the user has just performed a pref save). But for a lot of the more obscure prefs this should be good initial guidance.
Will post some initial observations to https://www.mediawiki.org/wiki/Requests_for_comment/Core_user_preferences