Hello all.
I'm putting the finishing touches on a script that exports the wikipedia in a format that can be directly imported to Yahoo!'s (and other's) search engine. It's nothing pretty (in fact, it's my first PHP), but I'd be grateful if 2 things would happen:
* Someone would look at it (I attached it) and say "this sucks because..."
* Someone would give me cvs write access, so I could add the file to the repository.
Eventually, this should be on a cron, updating periodically so that the search engines which use these results stay up-to-date. I saved a copy of the spec here:
http://www.bomis.com/idif/spec.pdf
I didn't come up with any good way of getting keywords for a given page. Using the linked page titles was a suggestion. Other ideas?
Thanks.