That would be nice to have a global cleanup at some point, but it won't be able to handle every situation. I don't think relying on VE to clean up is good:
- First, it will take a long time before all articles are edited with VE (maybe never) - Second, I'm not a big fan of VE changing wikitext in parts not modified by the user: experience shows that it messes the diffs, and makes watching what VE is doing a lot more difficult. It has been requested several times that VE doesn't start modifying wikitext in places not modified by the user.
Things that are probably safe to fix automatically:
- Whitespace characters between nowiki tags at the beginning of a line: remove everything including the whitespace characters. - Whitespace characters between nowiki tags not at the beginning of a line: remove the tags, keep the whitespace characters. - Some characters (letters, digits, ...) between nowiki tags: remove the tags, keep the characters - In a table, cell content with only a dash between nowiki: remove tha tags, add a whitespace characters before the dash
<nowiki /> are more difficult to fix automatically I think:
- Between quotes: allows to mix a real quote with italics formatting - After the end of a wikilink:prevents the wikilink to extend to the text (often an error due to a bug in VE, but sometimes it may be normal) - ...
Nico
On Sun, Jun 21, 2015 at 8:43 PM, Amir E. Aharoni < amir.aharoni@mail.huji.ac.il> wrote:
Thanks Arlo. I added a few.
But I'm not sure that it answers my original question: Will this be done every time a page happens to edited in VE and saved or will it be done globally on all pages in all wikis as some kind of a maintenance job?
-- Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי http://aharoni.wordpress.com “We're living in pieces, I want to live in peace.” – T. Moore
2015-06-20 19:45 GMT+03:00 Arlo Breault abreault@wikimedia.org:
On Friday, June 19, 2015 at 1:38 AM, Amir E. Aharoni wrote:
There may be more - I'm still looking for these.
If you find any, please propose them on the Parsoid’s normalization talk page [0]. I’ve added the ones you’ve mentioned so far.
We’ve documented [1] what’s currently been implemented.
A few months back, Subbu solicited feedback [2] on what style norms
should
be enforced. We’ve since added a `scrubWikitext` parameter to Parsoid’s
API
that clients (like VE) can benefit from.
Cleaning up our past transgressions is great. Helping to prevent their continued existence is even better.
I was reading the discussion on gradually enabling VE for new accounts
[3]
and Kww writes there,
"Further, we still have issues with stray nowiki tags being scattered across articles. Until those are addressed, the notion that VE doesn't cause extra work
for
experienced editors is simply a sign that the metrics used to analyze effort were wrong. Jdforrester, can you explain how a study that was intended to measure whether VE caused extra work failed to note that even with the current limited use, it corrupts articles at this kind of volume [4]? Why would we want to encourage such a thing?”
Makes me sad.
[0] https://www.mediawiki.org/wiki/Talk:Parsoid/Normalizations [1] https://www.mediawiki.org/wiki/Parsoid/Normalizations [2] https://lists.wikimedia.org/pipermail/wikitech-l/2015-April/081453.html [3]
https://en.wikipedia.org/wiki/Wikipedia:Village_pump_%28proposals%29#Gradual...
[4]
https://en.wikipedia.org/w/index.php?title=Special:AbuseLog&offset=&...
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Wikitech-l mailing list Wikitech-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wikitech-l