+ John and Lauren
On Tue, May 2, 2017 at 5:49 AM, Robert West <west(a)cs.stanford.edu> wrote:
One quick thought: it would be extremely useful for your purposes to
be able to study not only the static structure of references pointing
from Wikipedia to external documents, but also how these references
are used. Tracking this traffic is currently impossible, since it will
only leave a footprint on the webserver of the link target, not on
Wikimedia's webservers.
Have you thought about the possibility of funneling external links
through a Wikimedia URL, which would allow you to record the links
through which users leave Wikipedia?
I know this would be a major change to the infrastructure, and I'm not
sure how the privacy implications would line up with Wikipedia's
guidelines, but it's worthwhile giving it some serious thought. At the
very least, Wikimedia could store counts of external-link clicks,
without linking those clicks to users' Wikimedia-internal browse
traces.
ha! interesting that you say this. :)
John and Lauren reached out to us some months ago after we published
the work on Why We Read Wikipedia and asked about the possibility of
doing exactly what you say above. We met a few weeks ago and are now
exploring that space together. Pending on Board and FDC approvals,
this is an item which is part of our next annual plan programs.
Bob, you may have seen
https://purl.stanford.edu/ny213kn0075 before?
John wrote this some years ago because they had access to the server
logs of Stanford's Encyclopedia of Philosophy and they could learn
more about the relation between Wikipedia reference usage and that. It
would be great if we can repeat that kind of analysis, but this time
beyond just that one specific resource.
We will post the proposal on meta as soon as the steps are solidified.
In the mean time, feel free to chat with John and Lauren directly.
Andrea, I know Lauren will be in WikiCite as well. The two of you may
enjoy having a chat with each other. :)
Best,
Leila
p.s. John, Lauren: I'm not sure if you're on this public list. If
you're not, please feel free to subscribe at
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Bob
On Mon, May 1, 2017 at 9:17 PM, Andrea Forte <andrea.forte(a)gmail.com> wrote:
Hi all,
One of my PhD students, Meen Chul Kim, is a data scientist with experience
in bibliometrics and we will be working on some citation-related research
together with Aaron and Dario in the coming months. Our main goal in the
short term is to develop an enhanced citation dataset that will allow for
future analyses of citation data associated with article quality,
lifecycle, editing trends, etc.
The project page is here:
https://meta.wikimedia.org/wiki/Research:Understanding_the_context_of_citat…
The project is just getting started so this is a great time to offer
feedback and suggestions, especially for features of citations that we
should mine as a first step, since this will affect what the dataset can be
used for in the future.
Looking forward to seeing some of you at WikiCite!!
Andrea
--
:: Andrea Forte
:: Associate Professor
:: College of Computing and Informatics, Drexel University
::
http://www.andreaforte.net
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l