Bryan Tong Minh wrote:
To get to a real working API the first thing we need
is to store the
meta data as author, license, etc in the database, rather than putting
it all together in one text field. You don't want an API that parses
text.
Here is something I have been thinking about for a while, which could make this
kind of storage feasible: <http://brightbyte.de/page/WikiData_light>. It's only
an idea at the moment, but I believe it would be doable without too much
trouble, and could be made to scale. It's less powerfull than full-fletched
WikiData or Semantic MediaWiki, but it's far less comple and much easier to
integrate with Wikipedia operations - and I believe it would be flexible and
powerful enough to be useful.
Oh, and just for the record, let me mention
<http://commons.wikimedia.org/wiki/Commons:Tag_categories> here. From what I see
people don't follow the "all templates must be in those categories
directly"
bit, and are using subcategories - so getting the right info takes a bit more
processing, but that, too, would be doable by evaluating the tag category
hierarchy every week or so. That would most probably be a toolserver-based solution.
-- Daniel