Jared Williams wrote:
-----Original Message----- From: Gregory Maxwell On 1/24/07, Andrew Cates andrew@catesfamily.org.uk wrote:
Therefore, could I put in a request for a "greenlist" feature to allow sysop approved links to be generated without rel="nofollow"?
There are well over 8 million external links in enwiki alone, including well over a half million distinct domains via HTTP.
A greenlist is not a remotely sane solution because of this...
A Bloom Filter could hold the entire 8 million urls in about 1 megabyte with a false positive probability of 1% or 1.5megabytes with a 0.1% probability.
Whilst loading in and out of PHP wouldn't be nice, a separate service is certainly feasible.
A simple database table containing a list of hostnames is probably the simplest solution, and can be queried efficiently. The size of the table should not be an issue -- after all, we _already have_ all this data in our database in a much less compact format.