Kirill Lokshin schrieb:
On 7/11/06, Magnus Manske
<magnus.manske(a)web.de> wrote:
Are we prefering {{Persondata}} or {{Infobox
biography}}?
Should they be merged, somehow?
I'm asking this because I'm writing a tool to generate Persondata from
article text as a copy&paste text. Can scan whole categories at once.
One ({{Persondata}}) is raw metadata, but is applicable to _all_
biographies; the other ({{Infobox biography}}) is designed for display
to users, but has been replaced in certain types of biographies
(politicians, royalty, military leaders) with more specialized
templates.
I'll stick to Persondata, then.
Ideally, we could have a tool that would be able
to parse at least the
major infobox types and fill out the persondata fields; but I'm not
sure how much work it would be, considering that there is a certain
variation in how different infoboxes deal with particular data.
I've done raw text extraction for the German Personendaten. While the en
version is still experimental, it works OK in many cases:
http://tools.wikimedia.de/~magnus/persondata.php?category=1984_deaths
I could add popular infoboxes (any suggestions?) and German
Personendaten, if a de article exists.
Magnus
Hello Magnus,
Could you explain a little more about how you plan to use the tool?
I have some concerns about using it on Biographies of living people on Wikipedia-en. We
have too much unverified information added to these article. I strongly believe that each
article needs to be examined closely to verify the content meets our Wikipedia:BLP
guidelines before it is added to a template.
Regards,
Sydney Poore