Hi Reem,
Going back to the original question, I think your PetScan query is timing out because it doesn't like the depth=50 requirement - because this is a messy category tree it probably starts generating a lot of results at that depth. Remember that categories about individual people often end up in these categories, and any one of those can open up a whole new set of categories - as a result, it would either generate huge numbers of articles, or possibly go into an infinite loop.
If you cut the depth down you can get a decent number of results and apply the size filter.
depth=3 gets ~68k results, of which ~16k are over 10kb in size.
https://petscan.wmflabs.org/?language=en&project=wikipedia&depth=3&a...
Andrew.
On Sat, 12 Jan 2019 at 11:17, Reem Al-Kashif reemalkashif@gmail.com wrote:
Hello!
Hope this finds you well. I put together a query https://query.wikidata.org/#SELECT%20%3Fitem%20%3FitemLabel%20%3FsitelinkEn%0A%0AWHERE%20%7B%0A%20%3Fitem%20wdt%3AP31%20wd%3AQ5.%0A%20%3Fitem%20wdt%3AP106%20wd%3AQ36180.%0A%20%3Fitem%20wdt%3AP21%20wd%3AQ6581097.%0A%20%3FsitelinkEn%20schema%3Aabout%20%3Fitem%3B%0A%20%20%09%09%09%20%20%20%20schema%3AisPartOf%20%3Chttps%3A%2F%2Fen.wikipedia.org%2F%3E.%0A%20SERVICE%20wikibase%3Alabel%20%7B%20bd%3AserviceParam%20wikibase%3Alanguage%20%22en%22.%20%7D%0A%20%20%7D to create a list of English Wikipedia articles about male writers. Is it possible to filter the results by size? For example, articles that are larger than or equal to 10k bytes?
I understand that this is better done by PetScan, but my PetScan query https://petscan.wmflabs.org/?language=en&project=wikipedia&depth=50&categories=Male%20writers&ns%5B0%5D=1&larger=10000&search_max_results=500&interface_language=en&&doit= refuses to cooperate for a reason I don't know yet.. :/
Thanks in advance.
Best, Reem
--
*Kind regards,Reem Al-Kashif*
http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail Virus-free. www.avg.com http://www.avg.com/email-signature?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail <#m_-2239464299791067440_m_-842422966700709082_m_3608676783914821221_DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2> _______________________________________________ Wikidata mailing list Wikidata@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata