On Tue, 29 Mar 2005 22:16:23 +0200, Daniel Wunsch the.gray@gmx.net wrote:
however, there are some political difficulties with using this - it's based on java, and java is not free.
Fair point. What about Plucene, the perl port of Lucene?
It is not nearly as mature, stable or feature rich as Lucene, but hey.
I plan to be playing with Plucene a bit over the next couple of months : one initial avenue of interest is some rough and ready benchmarks on speed/resource requirements. I was planning to use a local copy of the wikimedia text as a corpus for this testing.
What I don't want to do is duplicate any existing work...