OK, I have some news: 1- Today I rewrote some parts of Kian and now it automatically chooses regulation parameter (lambda), thus predictions are more accurate. I wanted to push changes to the github but It seems my ssh has issues. It'll be there soon 2- (Important) I wrote a code that can find possible mistakes in Wikidata based on Kian. The code will be in github soon. Check out this link http://tools.wmflabs.org/dexbot/possible_mistakes_fr.txt. It's result from comparing French Wikipedia against Wikidata e.g. this line:
Q2994923: 1 (d), 0.257480420229 (w) [0, 0, 1, 2, 0]
1 (d) means Wikidata thinks it's a human
0.25... (w) means French Wikipedia thinks it's not a human (with 74.3% certainty)
And if you check the link you can see it's a mistake in Wikidata. Please check other results and fix them.
Tell me if you want this test to be ran from another language too.
3- I used Kian to import unconnected pages from French Wikipedia and created about 1900 items. The result is here http://tools.wmflabs.org/dexbot/kian_res_fr.txt and please check if anything in this list is not human and tell me and I run some error analysis.
Best
On Mon, Mar 16, 2015 at 9:50 PM, Amir Ladsgroup ladsgroup@gmail.com wrote:
Thanks Sjoerddebruin,
I'm working on this so I can write a system to find possible mistakes and it will find and report mistakes made by Dexbot or others. It works more precise as the time goes by.
Best
On Sun, Mar 15, 2015 at 8:51 PM Sjoerd de Bruin sjoerddebruin@me.com wrote:
Now the gender game is working again, I encountered there were a lot of issues with the following category: https://nl. wikipedia.org/wiki/Categorie:Danceact
As you can see, it's about musical groups but they all were marked as human.
Greetings,
Sjoerd de Bruin sjoerddebruin@me.com
Op 14 mrt. 2015, om 14:18 heeft Amir Ladsgroup ladsgroup@gmail.com het volgende geschreven:
I'm writing a parser so I can feed gender classification to Kian, It'll be done soon and you can use it :)
On Sat, Mar 14, 2015 at 12:53 PM Sjoerd de Bruin sjoerddebruin@me.com wrote:
Hm, the Wikidata Game is really slow. Magnus, if you read this: do you know what's going on? I play the gender game with only nlwiki articles, but it never loads. It was working yesterday with just 50 items, so it should work now imo.
Greetings,
Sjoerd de Bruin sjoerddebruin@me.com
Op 14 mrt. 2015, om 09:39 heeft Sjoerd de Bruin sjoerddebruin@me.com het volgende geschreven:
I've corrected two lists (Lijst van voorzitters van de SER and Lijst van voorzitters van de WRR) and a music group (Viper (Belgische danceact)). Will play the gender game the next few days to check them.
Greetings,
Sjoerd de Bruin sjoerddebruin@me.com
Op 14 mrt. 2015, om 00:51 heeft Amir Ladsgroup ladsgroup@gmail.com het volgende geschreven:
Sorry for the late answer, got busy in the real world. This is the result for unconnected pages of Dutch Wikipedia. http://tools.wmflabs.org/dexbot/kian_res_nl.txt Please check and tell me when they are not human. I'm producing result for empty items related to Dutch Wikipedia.
On Thu, Mar 12, 2015 at 2:58 PM Amir Ladsgroup ladsgroup@gmail.com wrote:
Sure, tonight it will be done.
Best
On Thu, Mar 12, 2015 at 2:08 AM, Sjoerd de Bruin sjoerddebruin@me.com wrote:
I'm ready for it! All existing humans on nlwiki have a gender now, so it's easy to review this batch. Bring it on.
Op 11 mrt. 2015, om 22:14 heeft Maarten Dammers maarten@mdammers.nl het volgende geschreven:
Hi Amir,
Amir Ladsgroup schreef op 9-3-2015 om 22:40:
Result for English Wikipedia (6366 articles classified as human) https://tools.wmflabs.org/dexbot/kian_res_en.txt
Sounds like fun! Can you run it on the Dutch Wikipedia too? On https://tools.wmflabs.org/multichill/queries/wikidata/ noclaims_nlwiki.txt I have a list of items without claims (linking them to other items).
Maarten _______________________________________________ Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
-- Amir
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l
Wikidata-l mailing list Wikidata-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikidata-l