On Sun, Apr 4, 2010 at 11:15 PM, Platonides <platonides@gmail.com> wrote:
El 04/04/10 17:49, Nicolas Vervelle wrote:
> Hi,
>
> Is there a way to parse a wiki text to get a simplified text (without
> HTML, external and internal replaced by their text, ...) ?
>
> My need is the following :
>
>     * The project Check Wikipedia
>       <http://de.wikipedia.org/wiki/Benutzer:Stefan_K%C3%BChn/Check_Wikipedia>
>       uses a configuration file for each wiki (for example: en
>       <http://toolserver.org/%7Esk/checkwiki/enwiki/enwiki_translation.txt>)
>     * It's used among other things to generate pages in Wiki format (for
>       example: en
>       <http://en.wikipedia.org/wiki/Wikipedia:WikiProject_Check_Wikipedia>)
>     * In the configuration file, you can see for example a description
>       of error n°1: /error_001_desc_script=This article has no bold
>       title like <nowiki>'''Title'''</nowiki>/, so it contains Wiki text.
>     * I am writing a Java program (WikiCleaner
>       <http://en.wikipedia.org/wiki/User:NicoV/Wikipedia_Cleaner/Documentation>)
>       to help fixing the errors reported by this tool. I'd like to
>       display this text in my program as a simple text: /This article
>       has no bold title like '''Title'''./
>
> Thanks,
> Nico

Use a text label.

Not sure I understand.
How will the wiki formatting will be interpreted by a text label ?

Nico