[Mediawiki-api] parse output

Michael Dale mdale at wikimedia.org
Tue Feb 24 17:33:10 UTC 2009


*snip*
> Yes, it's been filed before and WONTFIXed because parsing dozens or 
> hundreds of pages in one request is kind of scary performance-wise

but clearly it would be more resource efficient than issuing 30 separate 
additional requests... maybe we could enable it with a low row return 
count say 30 ? It should be able to grab the output from the parse cache 
no?

With my use case of returning search result descriptions...it does not 
really need html it just needs striped wikitext or even a striped 
segment of wikitext.

So here are a few possible ways forward:

* I can switch on 30 extra requests if we need to highlight the problem....
* I could try and use one of the javascript wikitext -> html converters
* Maybe we could support the output striped wikitext (really what we 
want for search results) ...

It appears Lucene and the internal mysql store the index in striped form 
if we could add access to that from the api that would be ideal way 
forward I think.

--michael



More information about the Mediawiki-api mailing list