Are the rules for detecting links which should be
excluded documented
anywhere?
The rules I use a bit different for various languages and depend on a
language configuration, which is made in a way similar to one defining
disambiguating template names at Mediawiki:Disambiguationspage.
Rules for connectivity analysis are simple, so I can just list them:
0. all redirects are thrown so that redirect pages are not present in
verticles set and all links through them added to edges set.
1. disambiguation pages are to be excluded from the articles set
(everything marked by a template linked from
Mediawiki:Disambiguationspage)
2. if a configuration exists (it does just for ru and uk at the
moment), some other pages can be exluded
3. all links from/to excluded pages are also excluded from edges set
4. links from chronological articles (this set is now empty for all
wikis except for ru and uk) are excluded from edges set
5. if an article is transcluded by another article (which happens
sometimes) it is assumed as linked instead
If you are interested in other confuguration setting rules, they are
described in russian and english languages here:
http://ru.wikipedia.org/wiki/%D0%92%D0%B8%D0%BA%D0%B8%D0%BF%D0%B5%D0%B4%D0%…
mashiah