Kirill Lokshin schrieb:
On 7/11/06, Magnus Manske <magnus.manske(a)web.de>
wrote:
Are we prefering {{Persondata}} or {{Infobox
biography}}?
Should they be merged, somehow?
I'm asking this because I'm writing a tool to generate Persondata from
article text as a copy&paste text. Can scan whole categories at once.
One ({{Persondata}}) is raw metadata, but is applicable to _all_
biographies; the other ({{Infobox biography}}) is designed for display
to users, but has been replaced in certain types of biographies
(politicians, royalty, military leaders) with more specialized
templates.
I'll stick to Persondata, then.
Ideally, we could have a tool that would be able to
parse at least the
major infobox types and fill out the persondata fields; but I'm not
sure how much work it would be, considering that there is a certain
variation in how different infoboxes deal with particular data.
I've done raw text extraction for the German Personendaten. While the en
version is still experimental, it works OK in many cases:
http://tools.wikimedia.de/~magnus/persondata.php?category=1984_deaths
I could add popular infoboxes (any suggestions?) and German
Personendaten, if a de article exists.
Magnus