2010/7/28 Lars Aronsson
<lars@aronsson.se>
Wiktionary can need many things, coverage of common
words as well as examples of how to use uncommon words.
From the Swedish Wikisource, I extracted the body text and
made a word frequency list,
This is very interesting. Can you tell us more details about? has been the job documented (in English, Swedish is "a little difficult" for me...) somewhere? I can produce lists by my rought script, but it works on raw wiki code and the result is "dirty" - it contains markup words, and obviously all wrong words too (seaching for wrong words was my fisrt aim...). Did you work on html dump perhaps?
--
Alex