Hi all,
I ran a few quick updates on Max's numbers today. As of 9/6/14:
* WIkidata has ~2080k items marked as people * Of these, ~1893k have a "gender" property (91%)
(Magnus's games are doing an amazing job at filling out these numbers, by the way - http://magnusmanske.de/wordpress/?p=213 )
Very quick and dirty statistics follow - note that since we have 9% undefined, the stats may change a bit as time goes on :-)
* The gender breakdown across all these people is approximately 1603k male, 290k female - 84.7% male and 15.3% female.
* enwiki is 15.5% female; arwiki 14.2%; dewiki 14.9% female; frwiki 15.2%; eswiki 15.9%; jawiki 18.2%; hiwiki 18.7%; zhwiki 20.1%
* It's interesting to note that these numbers mostly seem a point or two better than the numbers Max got a month ago, which probably represents better data-logging rather than change in the underlying content
* There are still very few items with a gender property other than "male" or "female" - perhaps 100-200 overall - but I suspect this number will significantly increase as we deal with the remaining items.
Andrew.
On 22 May 2014 18:16, Maximilian Klein isalix@gmail.com wrote:
Hi Everyone,
I just conducted some new research I though you might be intrigued by.
It compares the "sex or gender" labels in use by Wikidata today - 13 in total. The percentage of articles about "female"s by language.
The best are Serbian Wikipedia, or Urdu Wikipedia, depending on the size you count.
The Wiki's that have become most sexist in 2014 - English Wikpedia. And the Data Richness per sex value. - 6.2 Wikidata Statement per male, 6.0 per female.
See the full blog here, and please ask me questions and suggestions -
http://notconfusing.com/sex-ratios-in-wikidata-part-iii/
Max Klein ‽ http://notconfusing.com/
Gendergap mailing list Gendergap@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/gendergap