Hi,
I[0] am an Outreachy Round 15 intern and I am working on AICaptcha[1] project. This project is aimed at creating a better captcha system (like the Google invisible captcha) which can prevent/reduce the incidence of bots creating user accounts and spamming Wikipedia. My mentors on this project are Gergő Tisza[2] and Adam Roses Wight[3].
The key aspects of this project are: 1. Data capture for training a machine learning classifier which is elaborated in Phabricator task[4]. The data can be captured from the registration page using the WikiMediaEvents extension. 2. Feature selection, dealing with selecting the most appropriate features which can improve the classification model, explained in Phabricator task[5] 3. Finding appropriate machine learning classifier to create the model [6].
Kindly provide suggestions/ideas on Phabricator, so that any idea missed by oversight can be discussed. Also if there are any possible issues which I have not thought about yet, please comment on the tasks so that I can take care of them sooner rather than later.
Links: [0]: https://meta.wikimedia.org/wiki/User:Groovier [1] : https://phabricator.wikimedia.org/project/profile/3137/ [2]: https://www.mediawiki.org/wiki/User:Tgr_(WMF) [3]: https://www.mediawiki.org/wiki/User:Adamw [4]: https://phabricator.wikimedia.org/T183991 [5]:https://phabricator.wikimedia.org/T183998 [6]: https://phabricator.wikimedia.org/T184013
Thanks Vinitha
wiki-research-l@lists.wikimedia.org