Update: dawiki category "Personer" seems to have some category tree cycles in higher depths. Here are articles from that category (one layer deep) with no P31 in the item:
http://petscan.wmflabs.org/?psid=8128462


On Tue, Mar 5, 2019 at 1:00 PM Magnus Manske <magnusmanske@googlemail.com> wrote:
If you make the gender optional, you also get the items without gender:
http://tinyurl.com/yygze9da

"People on Danish Wikipedia but not on Wikidata" is either:
* a subset of "Danish Wikipedia articles not on Wikidata". You can get all of these (currently, 91) via my tool: https://tools.wmflabs.org/wikidata-todo/duplicity.php?wiki=dawiki&mode=list
* "people on Danish Wikipedia with an item but no P31". Using dawiki category "Personer", I am currently running PetScan but it's slow, will keep you posted
(for all dawiki items without P31 or P279, see http://tinyurl.com/y4u6lwyj )

On Tue, Mar 5, 2019 at 11:55 AM <fn@imm.dtu.dk> wrote:
Dear any Wikidata Query Service expert,


In connection with an editathon, I have made statistics of the number of
women and men on the Danish Wikipedia. I have used WDQS for that and the
query is listed below:

SELECT ?count ?gender ?genderLabel
WITH {
   SELECT ?gender (COUNT(*) AS ?count) WHERE {
     ?item wdt:P31 wd:Q5 .
     ?item wdt:P21 ?gender .
     ?article schema:about ?item.
     ?article schema:isPartOf <https://da.wikipedia.org/>
   }
   GROUP BY ?gender
} AS %results
WHERE {
   INCLUDE %results
   SERVICE wikibase:label { bd:serviceParam wikibase:language "da,en". }
}
ORDER BY DESC(?count)
LIMIT 25

http://tinyurl.com/y8twboe5

As the statistics could potentially create some discussion (and ready
seems to have) I am wondering whether there are some experts that could
peer review the SPARQL query and tell me if there are any issues. I hope
I have not made a blunder...

The minor issues I can think of are:

- Missing gender in Wikidata. We have around 360 of these.

- People on the Danish Wikipedia not on Wikidata. Probably tens-ish or
hundreds-ish!?

- People not being humans. The gendered items I sampled were all
fictional humans.


We previously reached 17.2% females. Now we are below 17% due to
mass-import of Japanese football players, - as far as we can see.


best regards
Finn Årup Nielsen
http://people.compute.dtu.dk/faan/

_______________________________________________
Wikidata mailing list
Wikidata@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikidata