Re: [Mediawiki-l] Wikitext grammar

6 Aug 2010

      If you are to extract only Wikipedia'a articles first paragraph no problema.
2010/8/6 Katharina Wolkwitz wolkwitz@fh-swf.de
...
Hi,
Am 05.08.2010 16:47 schrieb lmhelp2:
...
Thank you!
So here is the list I have for the moment:
I need to ignore lines:

containing: {{...}}
        => possibly spreading over several lines,
        => being possibly nested {{... {{ ... }} ... }}.
containing: [[...]]
        => being possibly nested [[... [[ ... ]] ... ]].
equal to: __TOC__
equal to: __NOTOC__
beginning with the '=' character
beginning with the '*' character

I don't think you should ignore lines beginning with the '*' character -
those
may include the wanted first paragraph of the text as the '*' is just a way
of
formatting the page...
Greetings
Katharina

MediaWiki-l mailing list
MediaWiki-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/mediawiki-l
-- 
{+}Nevinho
Venha para o Movimento Colaborativo http://sextapoetica.com.br !!

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

Re: [Mediawiki-l] Wikitext grammar