You might ask the folks at Freebase
http://www.freebase.com/ for help.
They gave a presentation at one of the SF-Bay area meetups recently
and described how they've managed to extract data about templates and
infoboxes from Wikipedia. I am fuzzy on the details but they can
probably help... I believe most of their code is open source.
-- Phoebe
On Wed, Sep 17, 2008 at 6:49 PM, Quse Guy <quseguy(a)yahoo.com> wrote:
Greetings all,
Please excuse me if my message is inappropriate for this list, but I'm
looking for a place to start and unsure where to begin.
I'm trying to get some sense of the scope of the Template namespace on the
English-language Wikipedia: anything from sheer numbers, to which templates
are the most edited, to which are the most used (either in terms of total
number of transclusions/What Links Here, or else actual number of "hits").
To be a bit more specific, I'm particularly interested in those Templates
which are in the following two categories:
*
http://en.wikipedia.org/wiki/Category:Navbox_(navigational)_templates
*
http://en.wikipedia.org/wiki/Category:Infobox_templates
But both of these categories consist of a large number of subcategories,
sub-sub-categories, sub-sub-sub-...categories, making it difficult to
attempt even a basic count of all the Navbox and Infobox templates. Of
course, determining which Navboxes and Infoboxes are the most edited or the
most used would be impossible to ascertain manually.
I haven't written an SQL statement since taking a database course in 1998,
so I'm wary of downloading one of the database dumps and attempting to
manipulate things on my own. Nor am I even certain that the data included
in the dumps would allow me to aggregate across sub-sub-sub...categories, or
derive edit counts or use counts.
Perhaps there's a GUI tool or interface that would be helpful in compiling
these stats? Or perhaps these statistics are readily available and I simply
haven't looked in the right places :)
Again, any advice on this matter would be most appreciated.
Regards,
David
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
--
- phoebe s. ayers | phoebe.ayers(a)gmail.com