[Foundation-l] How much of Wikipedia is vandalized? 0.4% of Articles

Brian Brian.Mingus at colorado.edu
Thu Aug 20 17:43:42 UTC 2009


On Thu, Aug 20, 2009 at 11:23 AM, Erik Zachte <erikzachte at infodisiac.com>wrote:

> There is another way to detect 100% reverts. It won't catch manual reverts
> that are not 100 accurate but most vandal patrollers will use undo, and the
> like.
>
>
>
> For every revision calculate md5 checksum of content. Then you can easily
> look back say 100 revisions to see whether this checksum occurred earlier.
> It is efficient and unambiguous.
>
>
>
> This will work for any Wikipedia for which a full archive dump is
> available.
>
>
>
>
> Erik Zachte
>

Luca's WikiTrust could easily reveal this info.



More information about the wikimedia-l mailing list