Hi all,
Thanks very much for your responses, I'm grateful to have so many options!
I've followed the instructions from James, and I'm running the query now.
If I have any issues I'll let you know!
Many thanks,
Nick
On 10 March 2018 at 12:31, James Salsman <jsalsman(a)gmail.com> wrote:
Hi Nick,
I made a Quarry query to do this for you:
https://quarry.wmflabs.org/
query/25400
You will have to fork it and remove the "LIMIT 10" to get it to run on
all the English Wikipedia articles. It may take too long or produce
too much data, in which case please ask on this list for someone who
can run it for you.
USE enwiki_p;
SELECT page_title as article, COUNT(DISTINCT pli.pl_from) as inlinks,
COUNT(DISTINCT plo.pl_title) as outlinks
FROM page
JOIN pagelinks AS pli ON page.page_title = pli.pl_title AND
pli.pl_namespace = 0
AND page.page_namespace = 0 AND page.page_is_redirect = 0
JOIN pagelinks AS plo ON page.page_id = plo.pl_from AND plo.pl_namespace =
0
AND page.page_namespace = 0 AND page.page_is_redirect = 0
GROUP BY article
LIMIT 10;
Refs.:
https://www.mediawiki.org/wiki/Manual:Pagelinks_table
https://www.mediawiki.org/wiki/Manual:Page_table
From: Nick Bell <bhink03(a)gmail.com>
Subject: [Analytics] Ingoing and outgoing internal links enquiry
Dear Analytics Team,
I’m doing a project on Wikipedia for my Maths degree, and I was hoping
you
could help me acquire some data about Wikipedia.
I would like to get the number of incoming internal links and outgoing
internal links for every page, if possible. I could limit this if needs
be,
as I am aware this totals around 11 million
values.
I have minimal programming experience, so if this is unreasonable or
impossible please let me know. I very much appreciate your time
considering
my request.
Many thanks,
Nicholas Bell
Mathematics Undergraduate
University of Bristol