[MediaWiki-l] Permanently remove old revisions and unused files?

Daniel Barrett danb at cimpress.com
Fri Feb 5 16:28:49 UTC 2016


What business problem are you trying to solve by deleting old revisions of articles? They don’t take up much disk space, and they aren’t visible unless you intentionally go looking for them (with the View History tab).  Is the problem just personal taste -- you don't like seeing so many revisions -- or is there some other business reason? Note: If you don’t want users to see revisions at all, you could hide the View History tab with a line in Mediawiki:Vector.css (or Common.css), at least as a first step:

   #ca-history { display:none; }

If the old versions are a security risk, there is feature to hide (not delete) particular revisions: https://www.mediawiki.org/wiki/Manual:RevisionDelete.

Regarding removal of unused, uploaded files, here is a SQL query that (I believe) lists all unused files that are more than 90 days old. (Critiques are welcome.) You can then feed the list to the script "maintenance/deleteBatch.php" supplied with Mediawiki to delete them.

 concat('File:', p.page_title) as 'unused file'
 wp_page p
 left outer join wp_imagelinks il on (il.il_to = p.page_title)
 inner join wp_image i on (i.img_name = p.page_title)
 il.il_to is null
 and datediff(now(), i.img_timestamp) > 90


From: MediaWiki-l [mailto:mediawiki-l-bounces at lists.wikimedia.org] On Behalf Of Mickey Feldman
Sent: Thursday, February 04, 2016 3:38 PM
To: mediawiki-l at lists.wikimedia.org
Subject: [MediaWiki-l] Permanently remove old revisions and unused files?

I have been looking for an extension or process to remove all revisions 
of pages "older than _date_" or "all but the last _n_", but have not 
found anything close.

This is a private corporate wiki used for internal documentation. Pages 
evolve, but then generally stabilize and are then only for reference and 
rarely edited. There is no need to keep the 100's of revisions that grew 
them to their final form.

Likewise, there are older and unused versions of uploaded files that are 
just clutter.

Extension:Nuke does not meet this need.
Extension:DeleteBatch doesn't either.
Extension:DeletePagePermanently - nope.

There are maintenance scripts for Deleting Archived revisions and 
purging old text - also not what I'm looking for.

So far I'm finding no way to do this other than manually, one page at a 
time, which is a no go. There are 10s of thousands of pages.

I may have to write a new extension from scratch, but I'm finding it 
hard to believe this functionality does not already exist.

Have I overlooked something obvious? Am I the only one who has wanted 
something like this?

Thanks in advance.

M. Feldman


Vigil Health Solutions Inc.

MediaWiki-l mailing list
To unsubscribe, go to:

More information about the MediaWiki-l mailing list