Am 14.05.2012 16:11, schrieb Lars Aronsson:
On 2012-05-14 15:44, Christoph Lauer wrote:
The connection in the german
wiktionary is a little different, there the link to the base form is
under "Grammatische Merkmale" (grammatical properties), and in the base
form of the verb the noun "Geburt" (birth) is found under "Abgeleitete
Begriffe" (derived terms).
I would be very happy if these informations could be extracted into the
dbpedia-wiktionary, in a unified way for all languages.
If you look around the various languages of Wiktionary, you will
find that German is the exception. Most languages follow the
pattern of the English Wiktionary. If you want things to work the
same way for all languages, the German Wiktionary would need
to be restructured from scratch. This is not likely to happen.
Still, the entry for bear (English Wiktionary, etymology 2, verb)
does list "born" as the participle near the headword. There is also
a list ofderived terms (bear down, bear up, ...), it just doesn't
list "birth" yet, but I think you are free to add it.
Thanks for the information. Too bad the german wiktionary makes such
exceptions there, it's the wiktionary I wanted to use :-(
However my central problem was that none of these informations aren't
available in the RDF dumps or through the SPARQL endpoint
, neither born -> bear, nor bear ->
birth/give birth, I thought maybe someone knows if there are plans to
import these informations. Does the project, which creates the dumps,
has a name anyway? Like dbpedia, the project creating the dumps from
Yes, there is no 'official' project which has developed structured
metadata for terms. There are a couple of projects which are working
with such information, the most mature of which is proprietary but the
JWKTL has a rich output.
The Wikipedia project itself has begun to implement semantic data via
microformats. Microformats embed some semantic structures directly
into articles in a machine-readable manner without affecting the display
of the article.
The primary method of applying microformats in Mediawiki software is via
the templates which we use. This seems like an obvious and simple way
for Wiktionaries - which make extensive use of templates to display
already-structured information - to add machine-readable structures to
our content. There are currently browser plugins/extensions for users to
take advantage of this added layer of data, plus of course websites and
webapps and other third-party ventures.