Daniel Friesen wrote:
Perhaps some deeper segregation of those stats would be useful. ie: Separate the numbers of styles used between templates and pages.
Then we might have a better idea of what kind of patterns are being used directly in pages that should actually be moved to templates or stylesheets.
This reply confused me a little. The script I ran exclusively looked at pages in the main namespace and exclusively looked at an XML dump, which is unexpanded wikitext. That is, assuming people aren't doing a lot of inline styling as arguments/parameters to templates, we should already have a decent amount of segregation as I only looked at direct uses.
Looking at the template namespace or looking at pages post-expansion would be annoying. I think templates aren't necessarily a bad place for inline styling, so I'm a lot less focused on templates than I am on articles.
Vi to wrote:
Do you have a old dump to check whatever the ratio has increased?
I'm personally not very interested in doing this, but using a similar dump from https://dumps.wikimedia.org/ and following the instructions laid out in https://phabricator.wikimedia.org/T115228 should make this fairly easy to do, if anyone is interested. I tried to methodically document all of the relevant source code and commands that I used, so that this same audit or an audit on another project or dump would be less work.
MZMcBride