On Dec 6, 2007 4:28 PM, Emufarmers Sangly emufarmers@gmail.com wrote:
That sounds extremely promising. How did you integrate mnoGo with your MediaWiki installation? If you would be willing to provide a step-by-step guide on MediaWiki.org, it would be very helpful (for me, anyway, and hopefully for other people!).
I should do this, your right. I don't work there anymore in the same capacity actually and rewriting all this for FOSS would be a good project to do now that I have the 'free time'. Lot of things I didn't like in that system. But this is a closer rundown of what I wrote:
SpecialScan * Created a Special page that handles the 'front end' functionality akin to the mnoGo scripts * Implemented functionality in the above to handle the new page creation workflow (asdf is not found, create? * This is click tracked so we know when people use it and what they searched for (like google basically) * Altered the Monobook to use the above over the original. * I think initially mod_rewrite was used as a cheap hack as well.
Backend: * Use perl and tt to create a indexer file on demand and launch the indexer. taking a peak at the mediawiki database to grab namespaces and other hints quick. * the indexer does its normal multiple burns through of the wiki under the two major user ids public and private. the private one is limited to just single namespace in this case though so its not a full run. * in mnoGo you can make 'datasets' of various regex url patterns to search, so adjusting the result weighting or the revisit time of pages like ^/w/Talk: can be useful if your into that sort of thing. * special care to follow interwiki links to a limited degree and peek into the mediawiki data for 'okaying' them.
SpecialScan/Terms * Special page for admins, since quieries are farmable making a special page for 'what did people most search for' and some cheap guesses at common failed searches goes here.
SpecialMnogo/Config * Special page for admin people to tune the indexing process to some degree, purge them all, manually kick it off etc ... so I dont have to be involved. * Also handle the external urls here (mailing lists, and interwiki limiters)
SpecialMnogo/Logs * Special page for admin people to see the indexing logs for what its worth