if you're interested in accessing the "Abgeleitete Begriffe" (derived terms)
from the German Wiktionary, you could use the JWKTL software . It is a Java library for
parsing German and English Wiktionary dump files and accessing much of the information
encoded in Wiktionary in a structured way.
Hope it helps!
[wiktionary-l-bounces(a)lists.wikimedia.org]&quot; im Auftrag von "Christoph
Gesendet: Montag, 14. Mai 2012 15:44
Betreff: [Wiktionary-l] more grammatical information in the extraction framework
Hello everyone, sorry for my partially bad english.
I would like to use the dbpedia-based wiktionary framework (through
dumps or online access) for a program I'm developing, which (trys to)
extract the article name and property from a natural language question
and queries the dbpedia for the answer. Particularly interesting would
be the extraction of a property from a verb, e.g. if I ask when someone
was "born", the program has to find a connection to the noun "birth",
which can then be further processed. Wiktionary provides that link by
first relating to the base form "to bear" and there under "Etymology 2 -
Verb" the transitive meaning "give birth". The connection in the german
wiktionary is a little different, there the link to the base form is
under "Grammatische Merkmale" (grammatical properties), and in the base
form of the verb the noun "Geburt" (birth) is found under "Abgeleitete
Begriffe" (derived terms).
I would be very happy if these informations could be extracted into the
dbpedia-wiktionary, in a unified way for all languages. Unfortunately
I'm not such an expert programmer, it would probably take weeks until I
found my way through mercurial, maven and the entire source code of the
framework to do the extraction myself, so I was hoping, someone with
more experience with the framework could do it (if it doesn't take weeks
of work :-) ). Thanks in advance!
Wiktionary-l mailing list