On Tue, Dec 01, 2015 at 07:50:23PM +0100, Federico Leva (Nemo) wrote:
Edison Nica, 29/11/2015 16:56:
how many non-wikipedia pages point to a certain wikipedia page
I guess the only way we have to know this (other than grepping request logs for referrers, which would be quite a nightmare) is to access the Google Webmaster account for wikipedia.org (to which a couple employees had access, IIRC).
There are a couple of other ways to figure out inlinks:
* Common Crawl * Commercial SEO services like Moz or Ahrefs
In the medium term the Internet Archive is going to be generating this kind of link data as part of the Wayback Machine search engine effort.
And finally, Edison, counting the number of inlinks without considering their rank or popularity will probably leave you vulnerable to people orchestrating googlebombs. And you might want to also know the anchortext, that's extremely valuable for search indexing.
-- greg