[Toolserver-l] Static dump of German Wikipedia

Platonides platonides at gmail.com
Sat Sep 25 18:02:17 UTC 2010


Marco Schuster wrote:
> On Sat, Sep 25, 2010 at 12:56 AM, Platonides <platonides at gmail.com> wrote:
>>> Actually it's not so much that they are on the bottom of the list as
>>> that there are two people potentially looking at them, and they are
>>> Tomasz (who is also doing mobile) and me (and I am doing the XML dumps
>>> rather than the HTML ones, until they are reliable and happy).
>>>
>>> However if you are interested in working on these, I am *very* happy to
>>> help with suggestions, testing, feedback, etc., even while I am still
>>> woroking on the XML dumps.  Do yuu have time and interest?
>>>
>>> Ariel
>>
>> Most (all?) articles should be already parsed in memcached. I think the
>> bottleneck would be the compression.
>> Note however that the ParserOutput would still need postprocessing, as
>> would ?action=render. The first thing that comes to my mind is to remove
>> the edit links (this use case alone seems enough for implementing
>> editsection stripping). Sadly, we can't (easily) add the edit sections
>> after the rendering.
> This should be doable using a simple regex which plainly goes for
> <span class="editsection">.
> 
> Marco

It is (with the current skins). I meant as a core feature, which would
need to be more precise.



More information about the Toolserver-l mailing list