On 8 Apr 2003 at 11:20, Brion Vibber wrote:
On Tue, 2003-04-08 at 08:34, Krzysztof P. Jasiutowicz wrote:
Googling for something for the Polish Wikipedia I discovered that the test site test.wikipedia.com is indexed by Google robots even with edit pages.
We surely don't want this to happen ?
I forgot that even exists... That's on the old server, which I have no control over. It should be removed entirely, or at least the DNS pointed at the new server.
I know that this issue is like a yo-yo, but maybe there's something more that admins can do. Somebody at Polish Wikipedia, looking for wibrator site:org in Google, found this: pl.wikipedia.org/w/wiki.phtml?title=Wibrator&action=edit And this is not the way we want to welcome newcomers, right?
Another issue: looking at http://pl.wikipedia.org/robots.txt we can see (caution: the address below may cause misunderstanding but that's what one can see!)
# robots.txt for http://www.wikipedia.org/
User-agent: * Disallow: /wiki/Special:Maintenance Disallow: /w/
If so, then how _our_ Special pages are excluded from indexing? To remind you, our namespace is called Specjalna, so I suggest replacement:
Disallow: /wiki/Specjalna:Maintenance
BTW, why don't we put just
Disallow: /wiki/Specjalna:
Regards Youandme