On 7/22/19 10:51 AM, Arlo Breault wrote:
On Jul 22, 2019, at 5:11 AM, Sergey F sergey@fidoman.ru wrote:
<p>test2<span> test3 </span></p>
The result of conversion is:
test2<span> test3
</span>
Yes, this looks like a bug
See https://gerrit.wikimedia.org/r/c/mediawiki/services/parsoid/+/524811
Thanks
Thanks Arlo!
Sergey:
It is possible that Arlo's bugfix will satisfy your use case.
However, note that Parsoid will introduce <nowiki> protection around characters that will parse differently if not escaped. So "<p> foo<p>" will convert to "<nowiki> </nowiki>foo". You can avoid this by passing the 'scrub_wikitext' flag to the html -> wikitext API endpoint [1]. This tells Parsoid to normalize[2] the input HTML to eliminate the need for those nowikis.
FYI in case this flag is pertinent to your use case.
Subbu.
1. https://www.mediawiki.org/wiki/Parsoid/API#For_HTML_-%3E_wikitext_requests