Real-time mirrors seem to be a recurring phenomenon. They are a drain on
Wikipedia's resources, and hunting them and shooting them down is a
continuing battle.
The reasoning behind these mirrors appears to be:
1 putting up a Wikipedia mirror with ads will make money...
2 too lazy to set up a proper mirror...
3 instead, set up a script that queries Wikipedia in real time...
4 profit!
However; why not turn this on its head, and offer a real-time, or
near-real-time, Wikipedia feed service to paid-up subscribers?
Currently, Wikipedia's running costs are about $1.2M per year, and this
pays for, among other things, serving about 4000 hits per second, that
is to say, about 1.26 x 10^11 hits per year, or about $ 10^-5 per hit.
(Of course, this is average gross cost; marginal cost will be
significantly higher, say $ 10^-4 per hit).
Web advertising rates are generally of the order of $1 CPM: that is, $
10^-3 per hit. If an advertiser manages to get 10,000,000 hits per year,
they will make $10,000 in ad revenue, and costs the Wikimedia Foundation
around $1000 in leeched server load.
What if we were to turn things round, and charge (say) $ 2 x 10^-4 per
hit for an official real-time mirror service? (Of course, this would be
aggregated in lumps, because it's impossible to bill tiny fractions of a
dollar). Now, the economics to the mirror operator is $ 10^-3 - $0.2 x
10^-3 per hit, and they still make 80% of the money they would have
before, and don't need to worry about being cut off. However, the
economics for the WF are now quite different: instead of losing $ 10^-4
per hit, the Foundation would make $ 2 x 10^-4 income - $ 10^-4 cost per
hit, and thus makes $ 1000 gross profit over the course of the year for
those 10,000,000 hits, which can be ploughed back into achieving the
Foundation's charitable goals (for example, by buying new server kit and
bandwidth, or paying for other real-world activities).
Note that the users of the real-time mirrors are _not_ being charged for
use of the GFDL content, which remains freely available as before; they
are being charged for real-time access to WP data, with no need to run a
modified copy of MediaWiki in order to run their service.
Administration of the scheme could be made automatic, by allowing the
existing credit-card interface to be used to for payment, and entering
an IP address or addresses to be authorized, an E-mail address for
contact, and getting an authorization key mailed back.
As a result:
* Wikipedia remains ad-free
* the WF gets revenue
* the advertisers still get to make (slightly less) money, but this time
without leeching unauthorized resources.
The feed could be provided from the existing software, only with a "null
skin" that produced only the rendered page content, thus both slightly
reducing the load of producing it (eg. no check for messages, greater
possibility for caching), and, at the same time, making the page content
easier to re-use, by removing the need to strip the user-interface from
around the page contents.
With other changes, for example, not checking for red/blue links,
serving costs could probably be reduced even further, and quote possibly
WF could charge more than $ 2 x 10^-4 per hit. Given the number of
mirrors around, setting up this scheme might pay for itself in a month
or less.
Good idea, or bad idea?
-- Neil