Hi Gerd, Apparently SPARQL has problems with mutated vovels, you'll have to use the Unicode escape sequence for the letter "ä" instead (something with \u and 4 characters). Regards, Christoph
Hi Christoph,
thanks for your reply! In between we'll investigate here at SWC as well and let you know.
btw: Virtuoso returns an error when querying the endpoint (http://wiktionary.dbpedia.org/sparql) with the expression below. Is that an encoding issue?
SELECT * WHERE { ?s ?p ?o FILTER(bif:contains(?o, "häuser")) }
Regards, Gerd
----- Ursprüngliche Mail ----- Von: "Christoph Lauer" dbpedia@online.ms An: "The Wiktionary (http://www.wiktionary.org) mailing list" wiktionary-l@lists.wikimedia.org Gesendet: Donnerstag, 31. Mai 2012 17:12:16 Betreff: Re: [Wiktionary-l] Extracting German noun forms
Hi,
I'd like to extract German noun forms (Kasus and Numerus) but didn't find this data in the provided dumps.
Example: http://de.wiktionary.org/wiki/Haus
I need the data from the box: Kasus Singular Plural Nominativ das Haus die Häuser Genitiv des Hauses der Häuser Dativ dem Haus dem Hause den Häusern Akkusativ das Haus die Häuser
Any idea how to get this? a SPARQL query expression?
regards, Gerd
Hi Gerd, I suppose you mean the dbpedia dumps from wiktionary, because the wiktionary xml dumps contain the box data. If that is so, you're right that unfortunately they are not in there. So a SPARQL query won't help you either, it will give you the same informations that are in the dumps. In order to add these informations you would have to write a template for the "Entry Layout" as explained on the dbpedia website, but I'm not an expert on that, maybe Jonas can tell you more about that, or if it's even possible. Sory I can't help you any further :-)
Wiktionary-l mailing list Wiktionary-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiktionary-l
Wiktionary-l mailing list Wiktionary-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiktionary-l