On Sat, 2002-11-16 at 15:09, Jonathan Walther wrote:
On Sun, Nov 17, 2002 at 12:06:00AM +0100, Erik Moeller
wrote:
Many Wikipedia users try to insert as many
relevant links as possible,
making the relationship between "broken" and real links often 10:1. I
don't think it would make much sense to create blank rows for each one of
them just to have simpler table definitions.
That doesn't sound right. There are definately more "good" links in the
database than dangling ones. Maybe you could run a query and tell us the
results?
Assuming we're talking about pages that are linked to rather than the
raw total of links:
mysql> select count(distinct bl_to) from brokenlinks;
+-----------------------+
| count(distinct bl_to) |
+-----------------------+
| 188126 |
+-----------------------+
mysql> select count(distinct l_to) from links;
+----------------------+
| count(distinct l_to) |
+----------------------+
| 115708 |
+----------------------+
It's certainly not 10:1, but more like 5:3 in favor of not yet existing
pages on the English wiki.
-- brion vibber (brion @
pobox.com)