"Magnus Manske" wrote
Sounds like there is an interesting exercise for a statistician here: by sampling from the larger Wikipedias, estimate the total number of article topics in them, taken collectively.
Easy: all articles on en.wikipedia
- all articles on other wikipedias that do *not* have an interlanguage
link to en.wikipedia (substract doublettes that are connected in an "interwiki web" not including en.wikipedia; count these as 1)
There, all done :-)
You have a touching faith that interwiki is 100% efficient.
I think you've found a plausible upper bound, though.
Charles
----------------------------------------- Email sent from www.virginmedia.com/email Virus-checked using McAfee(R) Software and scanned for spam