[Mediawiki-l] Indexing Wikipedia with Solr/Lucene
vineet.yadav.iiit at gmail.com
Sun May 13 18:53:27 UTC 2012
I want to create Lucene/Solr index of wikipedia xml dump. I used Solr
to index wikipedia xml dump. Since in wikipedia, Category and external
links are part of wikipedia text, I am not able to index category and
external links separately. I want to index Category, Externals
links etc separately and store them in separate fields.
Would anyone please be kind enough to give me a bit of advice?
More information about the MediaWiki-l