As devotees of web standards are aware, HTML5 is no longer an XML
variant (nor is it SGML).
This occasionally leads to fun times in Visual Editor and Parsoid
land, as we try to work around various browser incompatibilities to
ensure that documents are parsed consistently. Parsoid uses an HTML5
parser, but it uses its own non-HTML5-spec serializer (ie, not
document.body.outerHTML) in order to emit XML-compatible documents
that work around certain browser bugs (and use intelligent quoting to
reduce document size). Visual Editor tries to parse parsoid output
using the browser's XML serializer due to bugs in Internet Explorer (I
believe) and then fixes up the output to match the HTML5 parser spec
for <pre> tags. I'm not sure exactly how Visual Editor serializes its
documents to send them back to Parsoid. I bet it's not quite the same
way Parsoid serializes them.
In any case, I filed bugs with the W3C months ago to try to fix some
of the specs. In particular, there is no official spec algorithm for
serializing an HTML document as XML. That may now be fixed! See
https://www.w3.org/Bugs/Public/show_bug.cgi?id=13410 (start at comment
13 if you are impatient).
It would probably be worth auditing VE and Parsoid's serialization
algorithms to ensure that they are compatible with the new draft
standard ( http://www.w3.org/TR/DOM-Parsing/#dfn-concept-xml-serialization-algorithm
), so that we can suggest improvements if we've got interesting corner
cases and weird hacks that turn out to be needed for interoperability
in the real world.
(And see also https://www.w3.org/Bugs/Public/show_bug.cgi?id=25225 --
it turns out that not even the HTML serializer API is completely
defined in the spec, although `outerHTML` provides a means to get at
the HTML fragment serializer. We had some issues with disappearing
whitespace in the outer contexts of HTML documents as a result.)
--scott
--
(http://cscott.net)
Hoi,
I understand that Phabricator is the place to be. My user for phabricator
thinks I use an old WMF profile. Can someone please change this to this
email address for me?
As it is I am stuck. I cannot change it.
Thanks,
GerardM
Hello
I want to work on [0] project. I have worked on parsers using Flex and
Bison and done file translation of DXF file format and text format.
Because of my past working experience in parsers I have keen interest
in this project and want to contribute to this project. I have gone
through [0] and all other links mentioned in it. I want to know about
the future plans of the mentors regarding the project, so that I can
contribute to it with my programming skills.
[0] - https://www.mediawiki.org/wiki/FOSS_Outreach_Program_for_Women/Round_9#Exte…
--
Alisha Jain
blog - jainalisha14.wordpress.com
"Your Failure does not define you, but your determination does."
Hi all,
Due to the POODLE vulnerability in SSL3.0 that's been announced this
week and has made its round through the media, we decided that we
needed to disable SSL3.0 on all our HTTPS services today, to protect
the security of all our users. The bulk of that change has been
deployed today at 15:00 UTC for the wikis, and the remaining HTTPS
services are getting the same treatment throughout the day. Please see
our blog post on this topic for details:
http://blog.wikimedia.org/2014/10/17/protecting-users-against-poodle-by-rem…
If you see or hear about anyone having issues connecting to our sites
over HTTPS or logging in, please direct them at the link above, and
urge them to upgrade their software. Unfortunately due to the nature
of HTTPS we're not able to provide a fallback when users get an error
message due to this. We're still looking into the possibility to
provide affected users with an informative error message upon login
however, before they get redirected from HTTP to HTTPS.
As a side note, we've also deployed Google's SCSV SSL extension[1] on
our servers yesterday, such that the attack surface for such
vulnerabilities will be reduced in the future for clients which
support this extension.
[1] http://googleonlinesecurity.blogspot.nl/2014/10/this-poodle-bites-exploitin…
Thanks,
--
Lead Operations Architect
Director of Technical Operations
Wikimedia Foundation
Hello and welcome to the latest edition of the WMF Engineering Roadmap
and Deployment update.
The full log of planned deployments next week can be found at:
<https://wikitech.wikimedia.org/wiki/Deployments#Week_of_October_20th>
A quick list of notable items...
== Early week ==
* 10% of reader (non-logged-in) traffic will be served by HHVM servers
** see: <https://www.mediawiki.org/wiki/HHVM/About>
== Monday ==
* The IEG Review application will be deployed
** <https://bugzilla.wikimedia.org/show_bug.cgi?id=71597>
== Tuesday ==
* MediaWiki deploy
** group1 to 1.25wmf1: All non-Wikipedia sites (Wiktionary, Wikisource,
Wikinews, Wikibooks, Wikiquote, Wikiversity, and a few other sites)
** <https://www.mediawiki.org/wiki/MediaWiki_1.25/wmf4>
== Thursday ==
* MediaWiki deploy
** group2 to 1.25wmf4 (all Wikipedias)
** group0 to 1.25wmf5 (test/test2/testwikidata/mediawiki)
Thanks and as always, questions and comments welcome,
Greg
--
Greg Grossmeier
Release Team Manager