Heya,
Since yesterday afternoon we have been having intermittent packetloss
issues with Emery. Ori and myself wrote yesterday a simple patch to enable
sampling on two filters to reduce the workload. That seemed to work however
this morning packetloss came back. Paravoid pointed out that it was
probably due to the fact that Emery was running for 208 days and that older
versions of the kernel start doing weird things.
Rebooting Emery seemed to have resolved the issue and we are planning to
upgrade to Precise soon.
Best,
Diederik
I wanted to follow up and second on Magnus' December
request<http://lists.wikimedia.org/pipermail/analytics/2012-December/000272.html>for
more usable way to access page view stats. Mining these stats is
attracting an increasing amount of attention from researchers (
http://www.l3s.de/~kanhabua/papers/ECIR2013-WikiEvents.pdf,
http://arxiv.org/pdf/1212.5943v1.pdf, http://arxiv.org/pdf/1211.0970.pdf)
even as the current approaches for extracting them from stats.grok.se or
the dumps are slow and inhumane (respectively).
I'm also interested in looking at bursts of pageview activity on articles
and then examining the extent to which this pageview activity diffuses over
the local wiki-link network. I suspect this has strong implications for
understanding patterns of editing activity; namely, editing activity may be
non-trivially coupled with sudden attention to articles that are a few
degrees of separation away. I'd be happy to chat with folks inside or
outside of WMF about getting access to the relevant view stats and
beginning such an analysis.
Best,
Brian
http://arxiv.org/pdf/1208.4171.pdf
This is a pretty interesting and accessible description of best practices and design decisions driven by practical problems they had to solve at Twitter in the area of client-side event logging, funnel analysis, user modeling.
E3: check out section "3.2 Client Events" in particular, which is quite relevant to EventLogging.
Dario
Hi,
Someone knows about a tool or someone who can help with getting articles
stats from csv file?
Next week is the election day in Israel and I'm looking to get articles
stats of the parties and the candidate. to do it one by one will take too
much time. If there is any tool which can get CSV file with the article
names and return CSV file with the data - will be great.
Thanks,
Itzik
Does anyone know how to find or generate a list of the most frequently
edit Wikipedia articles in a certain category? We received a question
from a large newspaper who is interested in learning which topics from
their country received the most editor attention on the English
Wikipedia.
There are tools like http://www.wikistats.co/ (most edits during the
past 24h hours) and Wikirage
(http://en.wikipedia.org/wiki/Wikipedia:Wikirage , seems currently
defunct though). But they don't offer restricting the list per
category.
I guess I'm looking for something like
http://toolserver.org/~magnus/ts2/treeviews/ with edit counts instead
of page views, and capable of handling large category trees like that
for a country.
Hints about related results are appreciated too (e.g. articles most
frequently edited *from* a certain country).
--
Tilman Bayer
Senior Operations Analyst (Movement Communications)
Wikimedia Foundation
Hey folks,
I can't remember who to contact about this, It looks like DB replication
on db1047 halted on Jan. 8th. Specifically, the last event in the
recentchanges table is 20130108215551*. *
What's the current protocol for giving it a kick?
-Aaron