

I'm trying to index the full wikipedia article dump (XML) using the Lucene-search Extension.


I ran the bundled indexer scripts, and it ran for ~7 days before basically stalling about half way through.


I have read of people doing the full build in a couple of hours.


Can anyone point me towards any resources to reasons my setup might be so slow?


I'm on a fairly high-end server, so I don't think it's a hardware issue.

