On 12/22/06, Gregory Maxwell gmaxwell@gmail.com wrote:
Go ahead: Write the software, make it good, make it scale, make it robust so that you don't have to constantly twiddle with it to keep it working.
http://en.wikipedia.org/wiki/User:Wherebot
I have no doubt that Google's ratelimit can be worked out. I promise you that good work done towards these ends will not be work wasted. Make sure that it's sufficently modular that we'll be able to use it to generate queries against other texts sources.
Other text sources for the most part would be best run offline against database dumps since most of them would involve runs agaist data that cannont be freely acessed on the web.