[WikiEN-l] Real-time mirrors as a net revenue _source_?

Neil Harris neil at tonal.clara.co.uk
Sun Apr 9 17:07:32 UTC 2006


Real-time mirrors seem to be a recurring phenomenon. They are a drain on 
Wikipedia's resources, and hunting them and shooting them down is a 
continuing battle.

The reasoning behind these mirrors appears to be:

1 putting up a Wikipedia mirror with ads will make money...
2 too lazy to set up a proper mirror...
3 instead, set up a script that queries Wikipedia in real time...
4 profit!

However; why not turn this on its head, and offer a real-time, or 
near-real-time, Wikipedia feed service to paid-up subscribers?

Currently, Wikipedia's running costs are about $1.2M per year, and this 
pays for, among other things, serving about 4000 hits per second, that 
is to say, about 1.26 x 10^11 hits per year, or about $ 10^-5 per hit. 
(Of course, this is average gross cost; marginal cost will be 
significantly higher, say $ 10^-4 per hit).

Web advertising rates are generally of the order of $1 CPM: that is, $ 
10^-3 per hit. If an advertiser manages to get 10,000,000 hits per year, 
they will make $10,000 in ad revenue, and costs the Wikimedia Foundation 
around $1000 in leeched server load.

What if we were to turn things round, and charge (say) $ 2 x 10^-4 per 
hit for an official real-time mirror service? (Of course, this would be 
aggregated in lumps, because it's impossible to bill tiny fractions of a 
dollar). Now, the economics to the mirror operator is $ 10^-3 - $0.2 x 
10^-3 per hit, and they still make 80% of the money they would have 
before, and don't need to worry about being cut off. However, the 
economics for the WF are now quite different: instead of losing $ 10^-4 
per hit, the Foundation would make $ 2 x 10^-4 income - $ 10^-4 cost per 
hit, and thus makes $ 1000 gross profit over the course of the year for 
those 10,000,000 hits, which can be ploughed back into achieving the 
Foundation's charitable goals (for example, by buying new server kit and 
bandwidth, or paying for other real-world activities).

Note that the users of the real-time mirrors are _not_ being charged for 
use of the GFDL content, which remains freely available as before; they 
are being charged for real-time access to WP data, with no need to run a 
modified copy of MediaWiki in order to run their service.

Administration of the scheme could be made automatic, by allowing the 
existing credit-card interface to be used to for payment, and entering 
an IP address or addresses to be authorized, an E-mail address for 
contact, and getting an authorization key mailed back.

As a result:
* Wikipedia remains ad-free
* the WF gets revenue
* the advertisers still get to make (slightly less) money, but this time 
without leeching unauthorized resources.

The feed could be provided from the existing software, only with a "null 
skin" that produced only the rendered page content, thus both slightly 
reducing the load of producing it (eg. no check for messages, greater 
possibility for caching), and, at the same time, making the page content 
easier to re-use, by removing the need to strip the user-interface from 
around the page contents.

With other changes, for example, not checking for red/blue links, 
serving costs could probably be reduced even further, and quote possibly 
WF could charge more than $ 2 x 10^-4 per hit. Given the number of 
mirrors around, setting up this scheme might pay for itself in a month 
or less.

Good idea, or bad idea?

-- Neil





More information about the WikiEN-l mailing list