On 06/29/2015 09:20 AM, Brad Jorsch (Anomie) wrote:
On Fri, Jun 26, 2015 at 11:52 AM, Subramanya Sastry ssastry@wikimedia.org wrote:
The "PHP parser" used in production has 3 components: the preprocessor, the core parser, Tidy. Parsoid relies on the PHP preprocessor (access via the mediawiki API), so that part of the PHP parser will continue to be in operation.
As noted in my update, we are working towards read views served by Parsoid HTML which requires several ducks to be lined up in a row. When that happens everywhere, the core PHP parser and Tidy will no longer be used.
Do we have plans for avoiding code rot in "unused" the PHP parser code that would affect smaller third-party sites that don't using Parsoid?
My response to your other email covers quite a bit of this.
As far as I have observed, the PHP parser code has been quite stable for a while. And, small third-party sites are unlikely to have complex requirements and are less likely to hit serious bugs. In any case, we'll do a good-faith effort to keep the PHP parser maintained and we'll fix critical and really high priority bugs. But, simply by virtue of us being a small team with multple reponsibilities, we will prioritize reducing complexity in Parsoid over keeping the PHP parser maintained. In the long run, I think that is a better path to bringing the two systems together.
Subbu.