Re: [Wikitech-l] Alpha version of the VisualEditor now available on the English Wikipedia

14 Dec 2012

Hi Gabriel,

thank you for that information. Actually I already knew of the project. 
Therefore I could imagine a process as I described. This IMHO doesn't 
need much i18n, because there is an defined syntax for wt and for html.

On 2012-12-13 20:57, Gabriel Wicke wrote:
...
  On 12/13/2012 06:43 AM, Marco Fleckinger wrote:
  Implementing this is not very easy, but
developers can may use some of
 the old ideas. Parsing the other way around has to be realized really
 from the scratch but is easier because everything is in a tree. not in a
 single text-string.

 Neither de- nor searalizing includes any surface, testing could be done
 automatically really easy comparing the results of conventional and the
 new parsing. The result of the serialization can be compared with the
 original markup. 
 we (the Parsoid team) have been doing many of the things you describe in
 the last year:
 Ah, that was the project's name. ;-)

...
  * We wrote a new bidirectional parser / serializer -
see
 http://www.mediawiki.org/wiki/Parsoid. This includes a grammar-based
 tokenizer, async/parallel token stream transformations and HTML5 DOM
 building.
 Thank you for pointing to that. Will also be interested for one of my 
private project.

...
  * We developed a HTML5 / RDFa document model spec at
 http://www.mediawiki.org/wiki/Parsoid/MediaWiki_DOM_spec.

 * Our parserTests runner tests wt2html (wikitext to html), wt2wt,
 html2html and html2wt modes with the same wikitext / HTML pairs as used
 in the PHP parser tests. We have roughly doubled the number of such
 pairs in the process.

 * Automated and distributed round-trip tests are currently run over a
 random selection of 100k English Wikipedia pages:
 http://parsoid.wmflabs.org:8001/. This test infrastructure can easily be
 pointed at a different set of pages or another wiki.
 Once the top results in English are reached it should not be a big deal 
testing it on other wikis.

...
  Parsoid is by no means complete, but we are very happy
with how far we
 already got since last October.
 Congratulation for that results so far.

Cheers

Marco

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Alpha version of the VisualEditor now available on the English Wikipedia