+ John and Lauren
On Tue, May 2, 2017 at 5:49 AM, Robert West west@cs.stanford.edu wrote:
One quick thought: it would be extremely useful for your purposes to be able to study not only the static structure of references pointing from Wikipedia to external documents, but also how these references are used. Tracking this traffic is currently impossible, since it will only leave a footprint on the webserver of the link target, not on Wikimedia's webservers.
Have you thought about the possibility of funneling external links through a Wikimedia URL, which would allow you to record the links through which users leave Wikipedia?
I know this would be a major change to the infrastructure, and I'm not sure how the privacy implications would line up with Wikipedia's guidelines, but it's worthwhile giving it some serious thought. At the very least, Wikimedia could store counts of external-link clicks, without linking those clicks to users' Wikimedia-internal browse traces.
ha! interesting that you say this. :)
John and Lauren reached out to us some months ago after we published the work on Why We Read Wikipedia and asked about the possibility of doing exactly what you say above. We met a few weeks ago and are now exploring that space together. Pending on Board and FDC approvals, this is an item which is part of our next annual plan programs.
Bob, you may have seen https://purl.stanford.edu/ny213kn0075 before? John wrote this some years ago because they had access to the server logs of Stanford's Encyclopedia of Philosophy and they could learn more about the relation between Wikipedia reference usage and that. It would be great if we can repeat that kind of analysis, but this time beyond just that one specific resource.
We will post the proposal on meta as soon as the steps are solidified. In the mean time, feel free to chat with John and Lauren directly.
Andrea, I know Lauren will be in WikiCite as well. The two of you may enjoy having a chat with each other. :)
Best, Leila p.s. John, Lauren: I'm not sure if you're on this public list. If you're not, please feel free to subscribe at https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Bob
On Mon, May 1, 2017 at 9:17 PM, Andrea Forte andrea.forte@gmail.com wrote:
Hi all,
One of my PhD students, Meen Chul Kim, is a data scientist with experience in bibliometrics and we will be working on some citation-related research together with Aaron and Dario in the coming months. Our main goal in the short term is to develop an enhanced citation dataset that will allow for future analyses of citation data associated with article quality, lifecycle, editing trends, etc.
The project page is here: https://meta.wikimedia.org/wiki/Research:Understanding_the_context_of_citati...
The project is just getting started so this is a great time to offer feedback and suggestions, especially for features of citations that we should mine as a first step, since this will affect what the dataset can be used for in the future.
Looking forward to seeing some of you at WikiCite!!
Andrea
-- :: Andrea Forte :: Associate Professor :: College of Computing and Informatics, Drexel University :: http://www.andreaforte.net _______________________________________________ Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l