Hi Erick,
What you are calling "macros" are actually called "templates" in the
Wikipedia (or, more generally, MediaWiki) context. In Portuguese, the name is
"Predefinição."
You can find all the templates on a wiki through the "Special:AllPages" page,
and then choose the "Template" or "Predefinição" namespace.
Here is a direct link. I suspect you will find there are many thousands of templates;
breaking that down into those that supply significant text (which I would guess are the
majority) vs. those that do not will probably be a difficult task; sorry, I don't have
much to offer in that area.
http://pt.wikipedia.org/w/index.php?title=Especial:Todas_as_páginas&nam…
I hope this helps!
-Pete
--
Pete Forsyth
[[User:Peteforsyth]]
peteforsyth(a)gmail.com
503-383-9454 mobile
On Feb 14, 2012, at 1:14 PM, Erick Fonseca wrote:
Greetings,
I'm writing a script to read Wikipedia dump files and generate raw
text from them, much like it would appear in a web browser.
At first, I ignored all macros, discarding anything between {{ ... }},
but I soon learned that some macros generate useful text. Now I need a
comprehensive list of all existing macros, to know which of them I
should treat.
As I believe some macros are language dependent, I am dealing with the
Portuguese Wikipedia.
Thank you for any help,
Erick
_______________________________________________
Wikipedia-l mailing list
Wikipedia-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikipedia-l