On 14/10/13 17:52, Klein,Max wrote:
Hi all,
First of all I think this is fantastic research. It goes to show, it's not just
properties that we can correlate, but also the Labels, Aliases, Sitelinks, and the
connections between each field.
I would like to point out, as Markus does in his discussion - the relative
disproportionate representation of sex in Acadmia is the motivation for studying this. Let
us be sensitive to results in that field. Lets remember our simplifying assumptions. We
have flattened sex and gender into one measure, and at that this research makes a binary
male/female classification, where even the wikidata sex property is trinary (intersex). I
hope that in the future we can increase or change our view to how we model sex.
Indeed, it the debates on "gender inequality" and "gender
multiplicity"
look at things on very different zoom levels. The goal of my little
experiment (I would not call it research, as it has neither a hypothesis
nor any form of evaluation) was not to put individual people into rigid
gender buckets but to estimate rough global distributions. My error
margins are far too wide to make any realistic statement about "minority
genders" even if I had a method to consider them. As far as social
definitions of gender go, this is probably something to study in a wider
context of representation of social minorities in certain professional
fields.
Cheers,
Markus
________________________________________
From: wikidata-l-bounces(a)lists.wikimedia.org
<wikidata-l-bounces(a)lists.wikimedia.org> on behalf of Paul A. Houle
<paul(a)ontology2.com>
Sent: Sunday, October 13, 2013 5:32 PM
To: Discussion list for the Wikidata project.
Subject: Re: [Wikidata-l] Application: sexing people by name/research gender bias
Just as a suggestion, you can turn these kind of numbers into a probability
distribution using the beta distribution. If you use (1,1) as a prior you
get something like beta(251,1) for the the probability of the probability
that somebody named "Aaron" is male.
-----Original Message-----
From: Markus Krötzsch
Sent: Sunday, October 13, 2013 6:16 PM
To: Discussion list for the Wikidata project.
Subject: [Wikidata-l] Application: sexing people by name/research gender
bias
Hi all,
I'd like to share a little Wikidata application: I just used Wikidata to
guess the sex of people based on their (first) name [1]. My goal was to
determine gender bias among the authors in several research areas. This
is how some people spend their free time on weekends ;-)
In the process, I also created a long list of first names with
associated sex information from Wikidata [2]. It is not super clean but
it served its purpose. If you are a researcher, then maybe the gender
bias of journals/conferences is interesting to you as well. Details and
some discussion of the results are online [1].
Cheers,
Markus
[1]
http://korrekt.org/page/Note:Sex_Distributions_in_Research
[2]
https://docs.google.com/spreadsheet/ccc?key=0AstQ5xfO-xXGdE9UVkxNc0JMVWJzNm…
_______________________________________________
Wikidata-l mailing list
Wikidata-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l
_______________________________________________
Wikidata-l mailing list
Wikidata-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l
_______________________________________________
Wikidata-l mailing list
Wikidata-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata-l