Hey,
This is the 20th weekly update from revision scoring team that we have sent
to this mailing list.
New development:
- We implemented the basic functionality for handling bag of words and
other types of abstract feature vectors in `revscoring`. [1] This required
some changes to some dependencies as well. [2]
- We extended the user-group related features to include more of the
dominant groups outside of English Wikipedia [3] and incremented the models
that changed substantially [4]
Documentation:
- We extended the documentation at mw:Extension:ORES to make it easier
for new developers to work with us. [5]
Resourcing:
- We discussed the teams resourcing needs (hardware, engineering, and
community liaison support) with Wes Moran. [6]
Maintenance and robustness:
- We addressed a variety of issues around caching and how the ORES
extension loads new data
- ORES now returns headers that will disable secondary caching. [7]
- Our maintenance scripts will circumvent caches that do not listen to
no-cache headers. [8, 9]
- We fixed an issue where the ORES review tool would duplicate items in
Special:RecentChanges. [10]
- We standardized the extraction pattern for the enwiktionary model so
that it looks similar to other models. [11]
1.
https://phabricator.wikimedia.org/T132580 -- Implement abstraction for
Sparse Feature Vectors
2.
https://phabricator.wikimedia.org/T144430 -- Update yamlconf so that
import_path can handle deep attributes
3.
https://phabricator.wikimedia.org/T143909 -- Extend user group features
4.
https://phabricator.wikimedia.org/T144855 -- Increment ruwiki
editquality models
5.
https://phabricator.wikimedia.org/T144676 -- Improve technical
documentation in Extension:ORES in mediawiki.ore
6.
https://phabricator.wikimedia.org/T144517 -- ORES and Product:
resourcing discussion
7.
https://phabricator.wikimedia.org/T144193 -- Set max-age header to 0
seconds for ORES to quiet secondary caches
8.
https://phabricator.wikimedia.org/T144196 -- Get model version needs to
invalidate cache
9.
https://phabricator.wikimedia.org/T144195 -- Check model version
replaces every time it runs.
10.
https://phabricator.wikimedia.org/T144233 -- Redundant results in ORES
review tool
11.
https://phabricator.wikimedia.org/T144605 -- Fix makefile entry for
enwiktionary.rev_reverted.20k_2016.tsv
Sincerely,
Aaron from the Revision Scoring team