Re: [Abstract-wikipedia] Use case: generation of short description

8 Jul 2020

      Hi Jakob,
thank you for reaching out! You got some answers for the demonyms.
Regarding how to best learn from this for the Abstract Wikipedia: as you
say yourself, the code itself is just a few lines - but they come with a
lot of interesting lessons attached to it.
As you start doing that for an increasing number of languages we will run
into interesting issues, such as differences by grammatical gender being
expressed in some but not in other languages, how to deal with historical
countries, how to deal with cases as Marie Curie, Tesla, etc.
I think capturing these in an essay either on your web site or on wiki can
be very valuable.
Again, I think the code itself will be short and can be quickly rewritten -
but the lessons you learned on the way will too easily be forgotten if not
captured.
Thank you,
Denny
On Sat, Jul 4, 2020 at 8:06 AM Jakob Voß jakob.voss@gbv.de wrote:
...
Hi,
I want to auto-generate disambiguation description for African
politicians to be added to Wikidata, e.g. from the country Mozambique
(Q1029) the following descriptions should be generated:
Mozambican politician (en)
Mosambikanischer Politiker (de)
politico mozambicano (it)
...
This could be extended to other professions. My questions:

Can anyone point me to data sources where to best look up country

adjectives such as "Mozambican"?

Where/how to best store the lexical information for best reuse with

other renderers

If a create small renderers for this short descriptions, what

architecture do you prefer for best reuse?
My just-get-it-done solution would be a set of CSV files and a few lines
of Perl code, but maybe this use case can be aligned with Abstract
Wikidata to better learn about it.
Looking forward to collaborate,
Jakob

Abstract-Wikipedia mailing list
Abstract-Wikipedia@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/abstract-wikipedia

2024

2023

2022

2021

2020

Re: [Abstract-wikipedia] Use case: generation of short description