A machine generated thesaurus constructed from interwiki and wiki links
and tags contained in the XML dumps has been posted at
This build analyzes the 20070206 enwiki dumps and contructs a thesaurus
based upon relationships between wiki links and
interwiki links contained within the XML dumps. Included are raw files
of links, lexicon, and XML dump created based
upon the embedded Thesaurus inside of Wikipedia.
Text file of stripped links and interwiki links with tags:
Machine generated text lexicon of stripped links and interwiki links:
Machine Generated XML MediaWiki dump which can be imported as a basic
Thesaurus has a few title problems, I will finish it
up in the morning and post to the thesarus area.