Assuming it sets its own user-agent string, we can surely block that in mod_rewrite (if they tell us in advance what the user-agent string is)?
Or IBM could implement a simple blacklist of sites. If someone tries to go to en.wikipedia.org it explains that site doesn't want the burden, and bounces them to the database download page instead.