Thought the researchers on this list might enjoy having another dataset to play with. :-)
-Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
-------- Original Message -------- Subject: Re: [Wikitech-l] Prefs removal Date: Fri, 12 Oct 2012 14:18:37 -0700 From: Erik Moeller erik@wikimedia.org Reply-To: Wikimedia developers wikitech-l@lists.wikimedia.org To: Wikimedia developers wikitech-l@lists.wikimedia.org
On Mon, Oct 8, 2012 at 11:12 PM, Erik Moeller erik@wikimedia.org wrote:
I've pinged analytics to see if they can get us a better prefs report.
Dario Taraborelli has kindly made a detailed dataset of preferences available. He can chime in with details if needed.
The dataset can be found here:
http://thedatahub.org/dataset/wikipedia-user-preferences
These datasets capture the following:
- user_properties set to '' from the default -> active_prefs_0 - user_properties set to 1 from the default -> active_prefs_1 -> NOTE: This can be very misleading for non-boolean prefs - user_properties set to !='' from the default -> active_prefs_all
This is based on a dataset of _active_ users which is included. Unchanged prefs aren't included, specific non-boolean settings aren't included, and prefs with < 5 users aren't included. This is en.wp, other languages to follow.
Note there's all kinds of funkiness with how prefs may be serialized to the DB - prefs names and defaults may have changed, prefs may have been removed, and some stuff is serialized when it doesn't need to be (suggesting that prefs have been changed, when in fact the user has just performed a pref save). But for a lot of the more obscure prefs this should be good initial guidance.
Will post some initial observations to https://www.mediawiki.org/wiki/Requests_for_comment/Core_user_preferences
Thanks for forwarding this Sumana, just a quick note that the DataHub seems to have problems with file uploads at the moment, so expect the full non-enwiki data to be available in the coming days.
On Oct 12, 2012, at 2:23 PM, Sumana Harihareswara sumanah@wikimedia.org wrote:
Thought the researchers on this list might enjoy having another dataset to play with. :-)
-Sumana Harihareswara Engineering Community Manager Wikimedia Foundation
-------- Original Message -------- Subject: Re: [Wikitech-l] Prefs removal Date: Fri, 12 Oct 2012 14:18:37 -0700 From: Erik Moeller erik@wikimedia.org Reply-To: Wikimedia developers wikitech-l@lists.wikimedia.org To: Wikimedia developers wikitech-l@lists.wikimedia.org
On Mon, Oct 8, 2012 at 11:12 PM, Erik Moeller erik@wikimedia.org wrote:
I've pinged analytics to see if they can get us a better prefs report.
Dario Taraborelli has kindly made a detailed dataset of preferences available. He can chime in with details if needed.
The dataset can be found here:
http://thedatahub.org/dataset/wikipedia-user-preferences
These datasets capture the following:
- user_properties set to '' from the default -> active_prefs_0
- user_properties set to 1 from the default -> active_prefs_1
-> NOTE: This can be very misleading for non-boolean prefs
- user_properties set to !='' from the default -> active_prefs_all
This is based on a dataset of _active_ users which is included. Unchanged prefs aren't included, specific non-boolean settings aren't included, and prefs with < 5 users aren't included. This is en.wp, other languages to follow.
Note there's all kinds of funkiness with how prefs may be serialized to the DB - prefs names and defaults may have changed, prefs may have been removed, and some stuff is serialized when it doesn't need to be (suggesting that prefs have been changed, when in fact the user has just performed a pref save). But for a lot of the more obscure prefs this should be good initial guidance.
Will post some initial observations to https://www.mediawiki.org/wiki/Requests_for_comment/Core_user_preferences
-- Erik Möller VP of Engineering and Product Development, Wikimedia Foundation
Support Free Knowledge: https://wikimediafoundation.org/wiki/Donate
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
wiki-research-l@lists.wikimedia.org