One approach might be to get a list of all pages/links added using
Mark's method, then for each of the sets do a manual sample of a few
percent of the new links and see who added them - this would let you
know if you're looking at a situation where "almost all links to XYZ
were added by TWL users" or "only some links were added by non-TWL
users", and estimate accordingly.
(I suspect some will definitely be in the first batch)
Andrew.
On 14 January 2015 at 13:32, mjn <mjn(a)anadrome.org> wrote:
Aaron Halfaker <ahalfaker(a)wikimedia.org> writes:
...you'll need to parse wiki content in order
to extract external links.
I don't think they are stored in a table anywhere.
The links themselves are, but it isn't tied to editor information, so I
don't think will answer this particular query. In the database dumps at
dumps.wikimedia.org, the table that's dumped as
xxwiki-yyyymmdd-externallinks.sql.gz lists external links per-page. So
if you just wanted counts of link additions (or removals), you could
grab two dumps from different dates and compare. But you'll need to
parse the full revision information to get a count of who added which
links.
-Mark
--
mjn |
http://www.anadrome.org
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
--
- Andrew Gray
andrew.gray(a)dunelm.org.uk