Thanks, this is great fun!  As an Italian, let me quote: 

(0.42525520906166969, (7151, 3041, 59, 514, 63, 2519, 955), 'Penis')
(0.42516069788797062, (1089, 463, 29, 27, 16, 470, 84), 'Inner core')
(0.42490272373540855, (1285, 546, 11, 64, 27, 515, 122), 'Stuff')
(0.42477231329690346, (2745, 1166, 28, 110, 46, 1054, 341), 'Gun')
(0.42474916387959866, (2990, 1270, 37, 149, 23, 1190, 321), 'Monkey')
(0.42443438914027148, (1105, 469, 20, 21, 2, 427, 166), 'Incas')
(0.42433090024330899, (2055, 872, 39, 45, 15, 825, 259), 'Italian Renaissance')
(0.42375950742484608, (2761, 1170, 34, 94, 24, 978, 461), 'Watermelon')
(0.42362613587191694, (2311, 979, 22, 121, 19, 937, 233), 'Puppy')
(0.4235686492495831, (1799, 762, 20, 83, 34, 669, 231), 'Crap')

It is absolutely great to see that Italian Renaissance (with Incas) is one of the few cultural topics that makes it as high in the list as the usual excrement-sex-infantile type of things!!

Luca

On Fri, Aug 13, 2010 at 1:12 PM, Dmitry Chichkov <dchichkov@gmail.com> wrote:
If anybody is interested, I've made a list of 'most reverted pages' in the english wikipedia based on the analysis of the enwiki-20100130 dump. Here is the list:
http://wpcvn.com/enwiki-20100130.most.reverted.tar.bz
http://wpcvn.com/enwiki-20100130.most.reverted.txt

This list was calculated using the following sampling criteria:
* All pages from the enwiki-20100130 dump;
** Filtered pages with more than 1000 revisions;
** Filtered pages with revert ratios > 0.3;
* Sorted in descending revert ratios.

Page revision is considered to be a revert if there is a previous revision with a matching MD5 checksum;
BTW, if anybody needs it, the python code that identifies reverts, revert wars, self-reverts, etc is available (LGPL).

-- Regards, Dmitry

_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l