On 14/05/12 07:54 AM, Christoph Lauer wrote:
Am 14.05.2012 16:11, schrieb Lars Aronsson:
On 2012-05-14 15:44, Christoph Lauer wrote:
The connection in the german wiktionary is a little different, there the link to the base form is under "Grammatische Merkmale" (grammatical properties), and in the base form of the verb the noun "Geburt" (birth) is found under "Abgeleitete Begriffe" (derived terms). I would be very happy if these informations could be extracted into the dbpedia-wiktionary, in a unified way for all languages.
If you look around the various languages of Wiktionary, you will find that German is the exception. Most languages follow the pattern of the English Wiktionary. If you want things to work the same way for all languages, the German Wiktionary would need to be restructured from scratch. This is not likely to happen.
Still, the entry for bear (English Wiktionary, etymology 2, verb) does list "born" as the participle near the headword. There is also a list ofderived terms (bear down, bear up, ...), it just doesn't list "birth" yet, but I think you are free to add it.
Thanks for the information. Too bad the german wiktionary makes such exceptions there, it's the wiktionary I wanted to use :-( However my central problem was that none of these informations aren't available in the RDF dumps or through the SPARQL endpoint http://wiktionary.dbpedia.org/sparql, neither born -> bear, nor bear -> birth/give birth, I thought maybe someone knows if there are plans to import these informations. Does the project, which creates the dumps, has a name anyway? Like dbpedia, the project creating the dumps from wikipedia.
Yes, there is no 'official' project which has developed structured metadata for terms. There are a couple of projects which are working with such information, the most mature of which is proprietary but the JWKTL has a rich output.
The Wikipedia project itself has begun to implement semantic data via microformats[1]. Microformats embed some semantic structures directly into articles in a machine-readable manner without affecting the display of the article.
The primary method of applying microformats in Mediawiki software is via the templates which we use. This seems like an obvious and simple way for Wiktionaries - which make extensive use of templates to display already-structured information - to add machine-readable structures to our content. There are currently browser plugins/extensions for users to take advantage of this added layer of data, plus of course websites and webapps and other third-party ventures.
Amgine