-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Mashiah Davidson:
Currently I am not sure the difference is caused by the fact that rules for links taking/not taking into account are different because the difference in results looks too huge.
There are no rules for this at the moment, it will only find purely isolated clusters. Nonetheless, it already performs a traversal of the entire page tree, so removing some edges should not have a large effect on performance (it may even become faster).
Are the rules for detecting links which should be excluded documented anywhere?
- river.