On 09/10/2013 02:18 AM, Feng Yi wrote:
Hi, 
  I'm in google wiki knowledge team and currently work on Parsoid.
  I found there are some diffs between 
http://www.mediawiki.org/wiki/Parsoid/MediaWiki_DOM_spec and real Parsoid output. 
  For example: RFC 1945   
in page:
   <a rel="mw:ExtLink/PMID"
   resource="http://purl.org/commons/html/pmid/20610307"
   href="//www.ncbi.nlm.nih.gov/pubmed/20610307?dopt=Abstract"
   about="#_:PMID-20610307-1234"
   class="external mw-magiclink-pmid">
  PMID 20610307
</a>
in Parsoid:
<a typeof="mw:ExtLink/RFC" href="//tools.ietf.org/html/rfc1945" rel="mw:ExtLink/RFC">RFC 1945</a></p>

   I have removed the "parsoid-data" and other global tags, but still a lot of diff.
   The page is updated several days ago, so is the released Parsoid  not newest?

Hi Feng,

The spec for RFC is this:
<a rel="mw:ExtLink/RFC" 
   href="http://tools.ietf.org/html/rfc1945"
   resource="urn:ietf:rfc:1945"
   about="#_RFC-1945-1234"
   class="external mw-magiclink-rfc">
  RFC 1945
</a>
and output after discarding data-parsoid is:
<a rel="mw:ExtLink/RFC" 
   href="//tools.ietf.org/html/rfc1945"
   typeof="mw:ExtLink/RFC">
  RFC 1945
</a>
So, there are diffs, yes.  We are in the process of cleaning up our spec (and actively making a few changes) -- in some cases, the spec is outdated, and in other cases, the spec has information about proposed changes (but not yet implemented).  In the next week or so, we'll try to clean up our spec (removing/updating outdated information, or adding annotations where the spec is proposing changes).

Thanks for alerting us about the mismatch.

Subbu.