[Toolserver-l] Wiktionary table of contents

Lars Aronsson lars at aronsson.se
Wed Aug 31 17:23:40 UTC 2011


Has anybody compiled a list of all entries in Wikitionary? I know
there are XML dumps for page titles, e.g.
http://download.wikimedia.org/enwiktionary/20110827/enwiktionary-20110827-all-titles-in-ns0.gz

But I was thinking on a dictionary entry level, e.g. the word
"snigel" is available on en.wiktionary as a Norwegian Nynorsk
entry, and on fr.wiktionary and pl.wiktionary as a Swedish entry.
This could be expressed as a table with 3 columns:

  Entry   Site Language
  snigel  en   nn
  snigel  fr   sv
  snigel  pl   sv

An additional table could indicate the date when each Site's
XML dump was used to update this large table of entries.

Is anybody doing this already?


-- 
   Lars Aronsson (lars at aronsson.se)
   Aronsson Datateknik - http://aronsson.se





More information about the Toolserver-l mailing list