Hi,
I'm trying to index the full wikipedia article dump (XML) using the Lucene-search Extension.
I ran the bundled indexer scripts, and it ran for ~7 days before basically stalling about half way through.
I have read of people doing the full build in a couple of hours.
Can anyone point me towards any resources to reasons my setup might be so slow?
I'm on a fairly high-end server, so I don't think it's a hardware issue.
Thanks, Barry