New subject: RDFa and Microdata in MediaWiki

18 Jan 2010

Aryeh Gregor wrote:
...
  RDFa is a way to embed data in HTML more robustly than
with attributes
 like class and title, which are reserved for author use or have
 existing functionality. It allows you to specify an external
 vocabulary that adds some semantics to your page that HTML is not
 capable of expressing by itself. 
More to the point, it allows an RDF graph to be overlaid onto an XHTML document so that
the XHTML document and the RDF graph can share some strings. The XHTML data model
isn't extended per se. Instead, a separate RDF graph can be extracted.

...
  Both RDFa+HTML and Microdata are Working Drafts at the
W3C right now 
It's true that both HTML+RDFa and Microdata have been published in Working Drafts at
the W3C. However, Microdata has never been through a Working Group Decision to publish as
a First Public Working Draft while HTML+RDFa has. Microdata was added to a Working Draft
after FPWD and there has since been a Working Group decision to take Microdata out of that
spec.

It is reasonable to expect that soon HTML+RDFa and Microdata could be in the same stage
Process-wise, but it's inaccurate to portray them as being at the same stage
Process-wise right now.

...
  I should note that currently Google and a couple of
others support
 RDFa but not Microdata. 
See http://lists.w3.org/Archives/Public/public-rdf-in-xhtml-tf/2009Sep/0126.html (search
for the word "deviate").

Manu Sporny wrote:
...
  The general points that you made were riddled with
technical
 inaccuracies, bad advice, and if implemented by the MediaWiki community,
 would have resulted in semantic data that would have been ambiguous at
 best and erroneous at worst. 
With that introduction, I think it's fair to evaluate your message for inaccuracies or
relevant omissions as well.

...
  The above could be marked up in RDFa, with pre-defined
vocabs, like so: 
It should be noted that the concept of "pre-defined vocabs" is neither in the
HTML+RDFa draft nor in the RDFa in XHTML spec from the XHTML2 WG.

...
  <p
about="EmeryMolyneux-terrestrialglobe-1592-20061127.jpg"
    typeof="dctype:StillImage">
 <span property="dc:title">Emery Molyneux Terrestrial Globe</span>
 by <a rel="cc:attributionUrl" href="
 http://example.org/bob/"

       property="cc:attributionName">Bob Smith</span>
 is licensed under a <a rel="license"
 href="
 http://creativecommons.org/licenses/by-sa/3.0/us/"
 Creative  Commons Attribution-Share Alike
3.0 United States License</a>.</p> 
Hiding the CURIE declarations is a common pattern when advocating RDFa: It makes RDFa
appear tidier than it is. To write this in RDFa in XHTML (the RDFa spec you say is safe to
use for deployment), one would need to declare the CURIE prefixes:

<p xmlns:dctype="http://purl.org/dc/dcmitype/"
about="EmeryMolyneux-terrestrialglobe-1592-20061127.jpg"
   typeof="dctype:StillImage">
<span xmlns:dc="http://purl.org/dc/elements/1.1/"
property="dc:title">Emery Molyneux Terrestrial Globe</span>
by <a xmlns:cc="http://creativecommons.org/ns#"
rel="cc:attributionUrl" href="
http://example.org/bob/"

      property="cc:attributionName">Bob Smith</span>
is licensed under a <a rel="license"
href="
http://creativecommons.org/licenses/by-sa/3.0/us/"
...
 Creative Commons Attribution-Share Alike 3.0
United States License</a>.</p>

Philip Jägenstedt already covered other points about the examples.

...
  However - XHTML1+RDFa is a published W3C
Recommendation and it is safe to use it for deployment. 
RDFa in XHTML has indeed been published as a Recommendation jointly by the Semantic Web
Deployment Working Group and the XHTML2 Working Group. However, you fail to mention that
even though the document mentions "HTML" in its first sentence, all the
normative matter concerns strictly XHTML and the document has gone through the W3C Process
as a specification that applies to XML.

MediaWiki uses the text/html and, thus, its pages get processed as HTML, so it would be
inappropriate to rely on a spec that had been reviewed as an XML spec.

I think it's misleading to promote text/html deployment of specs whose normative
matter has been written and reviewed for XML. The most egregious example of this is that
the XHTML2 WG has written the normative matter of XHTML 1.x specs for XML but then
published a Working Group Note (Notes can be pretty much anything and don't go through
the W3C Recommendation track Process) that gives advice on deployment as text/html
(http://www.w3.org/TR/xhtml-media-types/).

Furthermore, the ease of getting a spec to REC at the W3C depends on how many people are
interested in the spec. The more people are interested in a spec, the more review comments
there are. The flip side is that when there's *less* interest in a spec, it's
easier to get it to Recommendation due to fewer comments raised. Thus, progress along the
REC track isn't a commensurable indicator of technical merit or technical maturity
across different specs and WGs.

Also, when assessing the "safe" deployability of RDFa in XHTML, it's
relevant to consider that 
 1) RDFa in XHTML was knowingly (see
http://lists.whatwg.org/pipermail/whatwg-whatwg.org/2008-August/015913.html) progressed on
the Recommendation track without resolving how RDFa works with HTML first.
 2) An RDFa 1.1 is in the works, and the changes being considered make RDFa 1.0 look like
a beta release. (Which is understandable, since a good part of the technical review of
RDFa has occurred after RDFa in XHTML was rushed to REC.)

-- 
Henri Sivonen
hsivonen(a)iki.fi
http://hsivonen.iki.fi/

Re: [Wikitech-l] RDFa and Microdata in MediaWiki