What a great idea! I think this would be fascinating - it's outside the scope of what we'll accomplish in the initial project phase but it would be an interesting complimentary project or something to think about as a second effort that would increase the value of the data.
Andrea
On Tue, May 2, 2017 at 8:49 AM, Robert West west@cs.stanford.edu wrote:
Hi,
This looks like a great project!
One quick thought: it would be extremely useful for your purposes to be able to study not only the static structure of references pointing from Wikipedia to external documents, but also how these references are used. Tracking this traffic is currently impossible, since it will only leave a footprint on the webserver of the link target, not on Wikimedia's webservers.
Have you thought about the possibility of funneling external links through a Wikimedia URL, which would allow you to record the links through which users leave Wikipedia?
I know this would be a major change to the infrastructure, and I'm not sure how the privacy implications would line up with Wikipedia's guidelines, but it's worthwhile giving it some serious thought. At the very least, Wikimedia could store counts of external-link clicks, without linking those clicks to users' Wikimedia-internal browse traces.
Bob
On Mon, May 1, 2017 at 9:17 PM, Andrea Forte andrea.forte@gmail.com wrote:
Hi all,
One of my PhD students, Meen Chul Kim, is a data scientist with
experience
in bibliometrics and we will be working on some citation-related research together with Aaron and Dario in the coming months. Our main goal in the short term is to develop an enhanced citation dataset that will allow for future analyses of citation data associated with article quality, lifecycle, editing trends, etc.
The project page is here: https://meta.wikimedia.org/wiki/Research:Understanding_
the_context_of_citations_in_Wikipedia
The project is just getting started so this is a great time to offer feedback and suggestions, especially for features of citations that we should mine as a first step, since this will affect what the dataset can
be
used for in the future.
Looking forward to seeing some of you at WikiCite!!
Andrea
-- :: Andrea Forte :: Associate Professor :: College of Computing and Informatics, Drexel University :: http://www.andreaforte.net _______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l