On Sun, Feb 5, 2012 at 5:38 AM, emijrp <emijrp@gmail.com> wrote:
2012/2/1 emijrp <emijrp@gmail.com>
... and I'm thinking about an analysis of male-female biographies ratio between Wikipedias.

After an analysis of a sample of 364k biographies where ~44% of them where classified using he/she his/her word occurences, it shows only 6.2% of female biographies on English Wikipedia. These results are preliminar, has anyone done a similar approach before? I don't want to reinvent the wheel.


I believe at one point John Vandenberg developed a statistical tool that allowed us to determine the percentage of male vs female Australian female sport competitors (identified through the use of categories) who had the meta data in their biographies completed.  Not quite the same thing, but was still extremely useful in the context of us identifying areas needing work.


--
twitter: purplepopple
blog: ozziesport.com