Hey,
This is the 29th weekly update from revision scoring team that we have sent to this mailing list.
Deployments:
- We deployed logging changes to ORES that will reduce the verbosity[1]
- We also deployed revscoring 1.3.0 and new models built with it to WMF labs[2]. This won't change anything important from a user-perspective, but it paves the way for developing new modeling strategies.
Maintenance and robustness:
- We fixed puppet so that log file directories are also created on the celery worker nodes (affects wmflabs)[3]
- We fixed an issue with our recall_at_fpr metrics which was incorrectly defined and implemented a recall_at_precision metric to take its place[4]
New development:
- We've made a lot of progress on modeling sentences and have just started experimenting with a sentence model from featured articles[5]
- We're reviewing a dataset of spam/vandalism/attack new page creations for public release[6]. This dataset will help our collaborators work with us on modeling the quality of drafts and supporting new page triage.
Sincerely,
Aaron from the Revision Scoring team