On Thu, Aug 17, 2006 at 02:03:07AM -0400, Simetrical wrote:
On 8/17/06, Jay R. Ashworth jra@baylink.com wrote:
On Thu, Aug 17, 2006 at 12:21:13AM -0400, Eric Astor wrote:
Single case that shows something interesting: '''hi''hello'''hi'''hello''hi'''
Try running it through MediaWiki, and what do you get? <b>hi<i>hello</i></b><i>hi<b>hello</b></i><b>hi</b>
I suspect that the "proper" parsing of that particular combination is undefined, and therefore you cna do anything you like.
Actually, the proper parsing of that particular combination is defined perfectly well. The problem is that HTML doesn't allow overlapping tags. If it did allow them, you would get a straight substitution:
'''hi''hello'''hi'''hello''hi''' -> <b>hi<i>hello</b>hi<b>hello</i>hi</b>
Each ''' becomes </?b>, each '' becomes </?i>. That's definitely defined behavior. But to make it valid HTML, you need to close the <i> before the <b>, then reopen it after the <b>, and so on. So this isn't something that can be worked around.
Sorry; I missed that he had not included the canonical example of this problem, which is that if you get ''''', there can be constructions wherein it's not possible to determine whether you're ending a bold and starting an italic, or the reverse, without context.
Wasn't someone from SoC working on this?
Did we ever get a final status report from the SoC work? (It's done now, isn't it?)
There were two SoC projects, one to have embeddable media and one to have a forum-like talk page instead of our current wiki thing. The former I don't know what happened to, the latter we have prototype code and a largely completed design for (and I believe the author of that project has agreed to try to see it through). The deadline is in a few days, I think, unless it's already past.
Ah. I had missed the first one, and am interested to see how the second one works out.
Cheer,s -- jra