-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1
Tim Starling wrote:
Anthony wrote:
What I still don't understand is what period
this information would be
from. Would it only be a UDP stream of new requests, or would it
include old log data? At least if it's only new requests those of us
who are "mildly paranoid" can make sure we always access WP through
tor.
I've been passing on questions from this thread to the researchers, and
I'm still waiting for their reply. So I won't answer most of the
questions just yet, but I can answer this one.
It's a UDP stream of new requests, they won't get any old data.
There is no old log data for them to have, except at 1/1000 sampling,
and even that has gaps in it due to disk-full conditions, and it doesn't
go back very far.
A stream of requests of live Wikipedia information is valuable as hell.
Imagine Google Zeitgeist, but instead of just getting a once-a-year
snapshot of it, you get it every second.
I think this information is too powerful and valuable to pick and choose
who gets it. Either find a way to release it with a public API so that
anyone can access it (and there are many excellent uses for this kind of
data), or don't release it at all. I especially don't like the idea of
certain people getting raw data that easily identifies users. It's a
total sidestep of the check user policy. I know this isn't what most
editors signed up for.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (MingW32)
iD8DBQFG6+s1vCEYTv+mBWcRAszQAJ4w9fZUKdEs95HqP/9fbUyjs9PUngCfRz8D
ikQGxK3Sb8jLYJRKY76cfQg=
=COUb
-----END PGP SIGNATURE-----