On Wed, Oct 24, 2007 at 05:32:43PM -0400, Jonathan Nowacki wrote:
I'd like to write a script that will mine the
media wiki for broken or
altered links. Once found the webmaster will be alerted to their presence.
I should have no problem writing this script in PHP, Perl or even Python but
I will need some help accessing the most current form of the text in the
media wiki. Does anyone know how to mine the most current text while
ignoring the rest?
Sure; you don't do it that way.
:-)
There's a table into which all external links go; you should be able to
just walk that to check for link rot.
Cheers,
-- jra
--
Jay R. Ashworth Baylink jra(a)baylink.com
Designer The Things I Think RFC 2100
Ashworth & Associates
http://baylink.pitas.com '87 e24
St Petersburg FL USA
http://photo.imageinc.us +1 727 647 1274