2008/7/30 David Katz dkatz2001@gmail.com:
On Wed, Jul 30, 2008 at 7:45 AM, Chris Howie cdhowie@gmail.com wrote:
On Wed, Jul 30, 2008 at 3:12 AM, David Katz dkatz2001@gmail.com wrote:
Would this prevent mirror sites from 'scraping' noindex pages? This would be a definite improvement as talk pages are often a depository of libel and whilst we can remove and oversight material on wp pages we're powerless to do anything on a mirror
I believe most mirrors use an actual database dump (though some might scrape). In that case they would get all the content on Wikipedia, wherever it is.
Is there anyway we can set things so that mirrors only mirror the actual namespace content and not talk, user and WP administrative pages?
Nope. Tthere are a couple of ways you could do it in in theory but the side effects would be unacceptable.