What a great idea! I think this would be fascinating - it's outside the
scope of what we'll accomplish in the initial project phase but it would be
an interesting complimentary project or something to think about as a
second effort that would increase the value of the data.
Andrea
On Tue, May 2, 2017 at 8:49 AM, Robert West <west(a)cs.stanford.edu> wrote:
Hi,
This looks like a great project!
One quick thought: it would be extremely useful for your purposes to
be able to study not only the static structure of references pointing
from Wikipedia to external documents, but also how these references
are used. Tracking this traffic is currently impossible, since it will
only leave a footprint on the webserver of the link target, not on
Wikimedia's webservers.
Have you thought about the possibility of funneling external links
through a Wikimedia URL, which would allow you to record the links
through which users leave Wikipedia?
I know this would be a major change to the infrastructure, and I'm not
sure how the privacy implications would line up with Wikipedia's
guidelines, but it's worthwhile giving it some serious thought. At the
very least, Wikimedia could store counts of external-link clicks,
without linking those clicks to users' Wikimedia-internal browse
traces.
Bob
On Mon, May 1, 2017 at 9:17 PM, Andrea Forte <andrea.forte(a)gmail.com>
wrote:
Hi all,
One of my PhD students, Meen Chul Kim, is a data scientist with
experience
in bibliometrics and we will be working on some
citation-related research
together with Aaron and Dario in the coming months. Our main goal in the
short term is to develop an enhanced citation dataset that will allow for
future analyses of citation data associated with article quality,
lifecycle, editing trends, etc.
The project page is here:
https://meta.wikimedia.org/wiki/Research:Understanding_
the_context_of_citations_in_Wikipedia
The project is just getting started so this is a great time to offer
feedback and suggestions, especially for features of citations that we
should mine as a first step, since this will affect what the dataset can
be
used for in the future.
Looking forward to seeing some of you at WikiCite!!
Andrea
--
:: Andrea Forte
:: Associate Professor
:: College of Computing and Informatics, Drexel University
::
http://www.andreaforte.net
_______________________________________________
Wiki-research-l mailing list
Wiki-research-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
--
:: Andrea Forte
:: Associate Professor
:: College of Computing and Informatics, Drexel University
::