jayvdb added a comment.
The change to using a unicode regex was here: https://www.mediawiki.org/wiki/Special:Code/MediaWiki/36253 , brought about due to https://phabricator.wikimedia.org/T16512.
It would be interesting to see if that regex is 'close' to the effect of previous linktrail regex , as it might be usable as a default .
We will never get perfect parsing of old revisions unless we load the regex from the php source code of the relevant MW version used at the time of the revision. Which is an insane problem to solve and unlikely anyone cares about accuracy that much.
TASK DETAIL https://phabricator.wikimedia.org/T97630
REPLY HANDLER ACTIONS Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign <username>.
EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/
To: jayvdb Cc: pywikipedia-bugs, jayvdb, Aklapper