On 18/11/12 13:36, Sumana Harihareswara wrote:
The Internet Archive wants to particularly make sure
to archive pages
that Wikipedians use as citations. A GSoC project last year got most of
the way to that goal but never quite finished making the feed of new
links for use by the Archive. Would anyone else like to take this up?
More information:
https://www.mediawiki.org/wiki/User:Kevin_Brown/ArchiveLinks
http://toolserver.org/~nn123645/toolserver-feed/cronscript.php (You
could ask Kevin to make his Toolserver project a MMP or you could just
write your own script.)
This is quite straightforward.
This is the longer plan, which is harder to do right. Although I see
code going in the right direction.
I'd go solving the problem with the toolserver MMP. It can be improved
later.
I see a potential problem of missing new content added to a page,
though. I'm not sure how Kevin expected to handle it. It's possible that
the archiver automatically recrawls then so it isn't needed (eg. IA vs
WebCite).