draicone wrote:
Hello all,
I'm in the process of building my first tool on ts, a text copyright violation checker similar to copyscape.com http://copyscape.com. However, it is currently so resource intensive I'm worried I'll crash the toolserver with it. Basically, it is a PHP script doing the following:
I'd be interested in trying it. I have a similar bot for new pages. Basically, it checks if it passes the "Google test". I'm quite satisfied with it, most false positives are for authorised copies and too small articles but comparing with yours would be interesting.