Re: [Wikitech-l] Wikimedia logging infrastructure

9 Aug 2010

On Mon, Aug 9, 2010 at 11:17 PM, Tim Starling &lt;tstarling(a)wikimedia.org&gt; wrote:
...
  On 10/08/10 15:16, Rob Lanphier wrote:
  We have a single collection point for all of our
logging, which is
 actually just a sampling of the overall traffic (designed to be
 roughly one out of every 1000 hits).  The process is described here:
 http://wikitech.wikimedia.org/view/Squid_logging

 My understanding is that this code is also involved somewhere:
 http://svn.wikimedia.org/viewvc/mediawiki/trunk/webstatscollector/
 ...but I'm a little unclear what the relationship between that code
 and code in trunk/udplog. 
 Maybe you should find out who wrote the relevant code and set up the
 relevant infrastructure, and ask them directly. It's not difficult to
 find out who it was. 
Well, yes, I was hoping you'd weigh in on this thread.

...
   At any rate,
there are a couple of problems with the way that it works:
 1.  Once we saturate the NIC on the logging machine, the quality of
 our sampling degrades pretty rapidly.  We've generally had a problem
 with that over the past few months. 
 We haven't saturated any NICs. 
Sorry, I assumed it was a NIC.  There has been packet loss, from what
I understand.  I'll leave it at that.

Rob

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] Wikimedia logging infrastructure