On 11/9/07, Thomas Dalton thomas.dalton@gmail.com wrote:
Backwards compatibility. The main suggestion I've seen is rewriting the parser in such a way as to make it behave like the old one in everything except a few unavoidable corner cases (bold italics is not a corner case).
I would view bold italics with adjacent apostrophes as a corner case. The behavior in that case makes very little sense and I doubt it's being widely used.
On 11/9/07, Stephen Bain stephen.bain@gmail.com wrote:
Well then, should it just take everything until the next whitespace?
Remember that some languages (like CJK) don't use whitespace to separate words. You would eat the entire paragraph. Regardless, I think we could probably do with eating all letter-characters (and number-characters? maybe not) from any alphabet that uses whitespace, for every language. Especially useful for sites like Commons or Meta or mediawiki.org. I've remarked on this before.
Anyway, if this behavior is not consistent across languages, we have the obvious problem that the parsing grammar depends on the language. This is probably not desirable. I suspect it would be entirely possible to make this behavior consistent across languages in this case, however, as I say.