[Mediawiki-l] robots.txt

6 Nov 2006


      Hello all:
I was interested in this exchange since I've never really been  
absolutely sure how to do this:
...
But in the interest of short URLs, I serve my MediaWiki directly from
site / without any /wiki/ or /w/ directories. So above meathod would
not work on my installation.
Any ideas how I can exclude robots from crawling all my wiki's edit,
history, talk, etc, pages *without* excluding its article pages?
Excluding index.php using robots.txt should work if an article link on
your page is http://mysite.tld/My_Page.
So, what do you do if  the wiki is in the root directory and not a  
subdir and you're using ugly URLs?
Thanks,
~Tricia
webmaster@prwatch.org

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

[Mediawiki-l] robots.txt