On 8/17/06, Jay R. Ashworth jra@baylink.com wrote:
On Thu, Aug 17, 2006 at 12:21:13AM -0400, Eric Astor wrote:
Single case that shows something interesting: '''hi''hello'''hi'''hello''hi'''
Try running it through MediaWiki, and what do you get? <b>hi<i>hello</i></b><i>hi<b>hello</b></i><b>hi</b>
I suspect that the "proper" parsing of that particular combination is undefined, and therefore you cna do anything you like.
Actually, the proper parsing of that particular combination is defined perfectly well. The problem is that HTML doesn't allow overlapping tags. If it did allow them, you would get a straight substitution:
'''hi''hello'''hi'''hello''hi''' -> <b>hi<i>hello</b>hi<b>hello</i>hi</b>
Each ''' becomes </?b>, each '' becomes </?i>. That's definitely defined behavior. But to make it valid HTML, you need to close the <i> before the <b>, then reopen it after the <b>, and so on. So this isn't something that can be worked around.
Wasn't someone from SoC working on this?
Did we ever get a final status report from the SoC work? (It's done now, isn't it?)
There were two SoC projects, one to have embeddable media and one to have a forum-like talk page instead of our current wiki thing. The former I don't know what happened to, the latter we have prototype code and a largely completed design for (and I believe the author of that project has agreed to try to see it through). The deadline is in a few days, I think, unless it's already past.