Anyway, I think the ideal would be to have druid + a
visualization package
instead of vital signs and then I could filter for exactly
what I want, and
I wouldn't have to >ask if includes bots ;-)
Since vital signs uses the pageview definition you can also take a look a
https://meta.wikimedia.org/wiki/Research:Page_view
To be clear, the definition used by vital-signs and
http://stats.wikipedia.org is the same one from May 2015 onwards and it
does not include traffic that is self-reported as bot traffic. We know,
however, that there is bot traffic that we do not tag as such.
On Fri, Jan 22, 2016 at 9:23 AM, Kevin Leduc <kevin(a)wikimedia.org> wrote:
I often consult vital signs (usually leading up to the
monthly metrics
meeting) to see how our traffic is doing. I'm always hoping it is bouncing
back up and it's something that could be presented at the metrics meeting.
I wish Vital Signs had monthly pageview data. I know I can use the
smoothing function... but I'm looking for a quick lookup of a number I can
report back: e.g. last month we had 15 billion pageviews. The reading team
is running hive queries to get this and it's one of their KPIs it:
https://www.mediawiki.org/wiki/Wikimedia_Product#Reading .
Anyway, I think the ideal would be to have druid + a visualization package
instead of vital signs and then I could filter for exactly what I want, and
I wouldn't have to ask if includes bots ;-)
On Fri, Jan 22, 2016 at 1:51 AM, Joseph Allemandou <
jallemandou(a)wikimedia.org> wrote:
Hi Kevin
I confirm that Vital Sign pageviews DO NOT contain the traffic we flag as
automated (see the code here
<https://github.com/wikimedia/analytics-refinery/blob/master/oozie/projectview/hourly/transform_projectview_to_legacy_format.hql#L79>
)
Joseph
On Fri, Jan 22, 2016 at 5:41 AM, Kevin Leduc <kevin(a)wikimedia.org> wrote:
Can you confirm that Vital Signs pageviews
include web crawler and bot
traffic?
On Thu, Jan 21, 2016 at 5:16 PM, Nuria Ruiz <nuria(a)wikimedia.org> wrote:
Hello,
The UI for vital signs will no longer display legacy pageview data
(pageviews calculations that used the old definition). We are working
towards having one consistent pageview definition [1] in every tool that
surfaces pageview data.
Please have in mind that the new definition only exists since May 2015
any data from before the switch was made was calculated using the old
(undocumented as far as we know) definition.
You can access vital signs UI in the following url:
https://vital-signs.wmflabs.org/#projects=ruwiki,itwiki,dewiki,frwiki,enwik…
Thanks,
Nuria
[1]
https://meta.wikimedia.org/wiki/Research:Page_view
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
*Joseph Allemandou*
Data Engineer @ Wikimedia Foundation
IRC: joal
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics