[Foundation-l] Release of squid log data

Ben McIlwain cydeweys at gmail.com
Sat Sep 15 14:24:53 UTC 2007


-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Tim Starling wrote:
> Anthony wrote:
>> What I still don't understand is what period this information would be
>> from.  Would it only be a UDP stream of new requests, or would it
>> include old log data?  At least if it's only new requests those of us
>> who are "mildly paranoid" can make sure we always access WP through
>> tor.
> 
> I've been passing on questions from this thread to the researchers, and 
> I'm still waiting for their reply. So I won't answer most of the 
> questions just yet, but I can answer this one.
> 
> It's a UDP stream of new requests, they won't get any old data.
> 
> There is no old log data for them to have, except at 1/1000 sampling, 
> and even that has gaps in it due to disk-full conditions, and it doesn't 
> go back very far.

A stream of requests of live Wikipedia information is valuable as hell.
 Imagine Google Zeitgeist, but instead of just getting a once-a-year
snapshot of it, you get it every second.

I think this information is too powerful and valuable to pick and choose
who gets it.  Either find a way to release it with a public API so that
anyone can access it (and there are many excellent uses for this kind of
data), or don't release it at all.  I especially don't like the idea of
certain people getting raw data that easily identifies users.  It's a
total sidestep of the check user policy.  I know this isn't what most
editors signed up for.
-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.6 (MingW32)

iD8DBQFG6+s1vCEYTv+mBWcRAszQAJ4w9fZUKdEs95HqP/9fbUyjs9PUngCfRz8D
ikQGxK3Sb8jLYJRKY76cfQg=
=COUb
-----END PGP SIGNATURE-----



More information about the foundation-l mailing list