Re: [Wiki-research-l] Help to solve three doubts on Wikipedia research data

11 Apr 2010

On Sun, Apr 11, 2010 at 6:19 PM, Ziko van Dijk &lt;zvandijk(a)googlemail.com&gt; wrote:
...
  Hello,

 Gregory (? if I remember well) mentioned in August 2009 this:
 http://papers.ssrn.com/sol3/papers.cfm?abstract_id=1446862
 All examined sites spy on their visitors, but Wikimedia and Wikipedia. 
It's possible to track click progress without setting tracking
cookies. However.

You can form an {IP address, Useragent} tuple for every search then
make the assumption that subsequent page loads are the same client.

This is less accurate than cookie based full tracking, but it should
be sufficient for training a machine learning system for predictive
search results.   Especially if we make the reasonable assumption that
users at the same location are already more likely to be looking at
similar materials.

You could also insert a tracking token in the search result HTTP get,
which would give you very accurate data but only for the clicks
directly off the search page.

Again, not a maximum amount of information, but likely sufficient and
it doesn't involve any deep privacy violating tracking.

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

Re: [Wiki-research-l] Help to solve three doubts on Wikipedia research data