If EleasticSearch regexp search is not easy to get going, then https://github.com/wikimedia/dumpgrepper can be used to get a count of matches per dump, as well as relative frequency in terms of % of articles.
On Wed, Oct 5, 2016 at 8:51 AM, C. Scott Ananian cananian@wikimedia.org wrote:
I just have anecdotal knowledge based on https://phabricator.wikimedia.org/T117165 -- people seem to use ISBNs in citations often enough to complain when they are <nowiki>'ed by Visual Editor, but I don't remember ever having any complaint about <nowiki>s around the RFC or PMID magic links. --scott
On Wed, Oct 5, 2016 at 11:12 AM, Chad innocentkiller@gmail.com wrote:
On Tue, Oct 4, 2016 at 11:52 PM Legoktm legoktm.wikipedia@gmail.com wrote:
- Deprecation strategy for Wikimedia wikis (e.g. Wikipedia)
Most of the migration can be done using a bot with some basic regexes, but the key part will be adapting templates to generate links (e.g.
ISBN
citation templates) instead of relying upon magic link functionality.
Question: do we have any kinds of numbers yet on how widely these are used across WMF projects?
It's info something we could probably get out of either Elasticsearch or the dumps probably :)
-Chad _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
-- (http://cscott.net) _______________________________________________ Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l