[Mediawiki-api] Re: Old Rendering or new approach

3 Feb 2023

      Hi,
On 2/3/23 01:11, Max Vlasov wrote:
...

Is there a plan to support the old design with some additional

parameters? Even if not forever, just for comparison purposes it would
be useful for me
You can use ?useskin=vector to fetch pages with the old HTML structure.
...

Is there another better way to get the text. Basically I make a

guessing work by converting some of the classical tags like H1/H2 etc
into pseudo headings and so on, Bullet tags into bullet chars etc. The
issue with the new design for me is that floating content now at the
same level as all the items of the //main[@id='content'] tag, so I
will have to do some filtering to get the main content without
supplemental information.
Have you looked at Parsoid HTML? It's annotated HTML that makes it 
pretty straightforward to parse and extract content from wiki pages.
See 
https://en.wikipedia.org/api/rest_v1/#/Page%20content/get_page_html__title_ 
for the API and https://www.mediawiki.org/wiki/Specs/HTML for the format.
-- Legoktm

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

[Mediawiki-api] Re: Old Rendering or new approach