Hello again,

Due to lack of of feedback so far, we are going to assume that revert information in the mediawiki.revision-create stream is not widely used.  We will move forward with removing it without blocking on first exposing the same information elsewhere.

Do let us know if there are objections.

Thank you!
-Andrew Otto
 SRE, Data Engineering, WMF




On Mon, Apr 19, 2021 at 9:37 AM Andrew Otto <otto@wikimedia.org> wrote:
Hi all,

tl;dr: we'd like to remove the rev_is_revert field from the mediawiki.revision-create stream to solve a missing event problem.

For years now, we've known that the mediawiki.revision-create stream has been missing many real revision create events when compared with MediaWiki's MySQL databases.  This makes the stream almost useless for those who want to use it as a notification mechanism about all MediaWiki page changes.

The reason for the large number of missing events is because the code that emits the event is subscribing to the wrong MediaWiki hook.  This patch will fix this, however the correct hook does not give us the information we need to set the  rev_is_revert and rev_revert_details fields.  This field is relatively new (only added last August 2020).  We think that including the missing revisions is more important than capturing the revert information, which really only captures whether or not a user used the MediaWiki UI to issue a revert.

We plan on moving forward with this, but would like feedback before we do.  If you have objections, or other ideas on how we can provide this data (like maybe including it in mediawiki/revision-tags-change and making that public?), let us know by replying to this email or in this ticket: https://phabricator.wikimedia.org/T215001

Thanks!
-Andrew Otto
 SRE, Data Engineering, WMF