I've installed my new faster wikipedia search engine.
This searches only the titles of the articles, not the body. You can still search the body by clicking onward to use the original search engine.
It's marginally clever. Search for 'jargon file' and 'poker jargon' and the results are sort of good.
If I had a perl function with the following specifications, I could easily turn this into a much better full-text search engine...
$PageText = &GetPageText("Computer_Jargon");
This ought to be simple, but for the life of me I have been unable to figure out how to parse Cliff's *.db files.
I suppose I could key off of the html cache files, but it sure would be nice to be able to update the database straight from the horse's mouth.
My code is or will be released under the GPL by, oh, Friday let's say. In the meantime, I'm too embarassed to let anyone see it. I took a search engine program or two from some other site or sites that I run and hacked them mercilessly. There's all kinds of useless variables and stuff.
I'll clean it all up and release it on Friday, but it's nothing very special.
wikipedia-l@lists.wikimedia.org