Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information about how their films are doing on social platforms over the course of their marketing campaign, up to a film's release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ ) and am thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
- Are you affiliated with the site above, or do you independently cover traffic volume?
- Do you know if it is possible to attribute a source country to traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards Alex
Hi Alex,
You probably just want to read Taha's publication on the exact topic: http://dataconomy.com/using-wikipedia-activity-data-forecast-movie-success/
Cheers, Tom
On Mon, Apr 27, 2015 at 4:27 PM, Alex Harrison alex.harrison@waytoblue.com wrote:
Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information about how their films are doing on social platforms over the course of their marketing campaign, up to a film’s release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ ) and am thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
Are you affiliated with the site above, or do you independently
cover traffic volume?
Do you know if it is possible to attribute a source country to
traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards
Alex
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Alex,
Thanks Thomas for mentioning our work. There's also an extension of our work to other languages/countries: http://arxiv.org/abs/1405.5924
Best, Taha
On Tue, Apr 28, 2015 at 5:05 PM, Thomas Steiner tomac@google.com wrote:
Hi Alex,
You probably just want to read Taha's publication on the exact topic: http://dataconomy.com/using-wikipedia-activity-data-forecast-movie-success/
Cheers, Tom
On Mon, Apr 27, 2015 at 4:27 PM, Alex Harrison alex.harrison@waytoblue.com wrote:
Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information
about
how their films are doing on social platforms over the course of their marketing campaign, up to a film’s release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ ) and
am
thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
Are you affiliated with the site above, or do you
independently
cover traffic volume?
Do you know if it is possible to attribute a source country to
traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards
Alex
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Dr. Thomas Steiner, Employee, Google Inc. http://blog.tomayac.com, http://twitter.com/tomayac
-----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.12 (GNU/Linux)
iFy0uwAntT0bE3xtRa5AfeCheCkthAtTh3reSabiGbl0ck0fjumBl3DCharaCTersAttH3b0ttom.hTtP5://xKcd.c0m/1181/ -----END PGP SIGNATURE-----
Hi Alex,
To quickly answer your first question, stats.grok.se is quasi-independent - it's run privately but uses the traffic data generated by WMF. If you want to process this yourself, it's available in hourly increments covering all pages on all Wikimedia sites, including all language editions of Wikipedia, with a day totalling about 2.5gb of compressed data. There are two formats:
http://dumps.wikimedia.org/other/pagecounts-raw/ - the "traditional" log files as used by stats.grok.se
http://dumps.wikimedia.org/other/pagecounts-all-sites/ - includes separated data for the mobile & Wikipedia Zero sites as well as the standard site. Probably substantially more useful for your purposes.
For your second question, not really, for privacy reasons. Some top-level data on overall traffic by country is available - see the rather fun https://ironholds.shinyapps.io/WhereInTheWorldIsWikipedia/ - but not on a per-page level.
Andrew.
On 27 April 2015 at 15:27, Alex Harrison alex.harrison@waytoblue.com wrote:
Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information about how their films are doing on social platforms over the course of their marketing campaign, up to a film’s release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ ) and am thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
Are you affiliated with the site above, or do you independently
cover traffic volume?
Do you know if it is possible to attribute a source country to
traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards
Alex
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
http://datavis.wmflabs.org/where/ has a more updated version of the vis ;). Happy to answer any and all research questions about our readership data, since I gather most of it.
On 28 April 2015 at 12:51, Andrew Gray andrew.gray@dunelm.org.uk wrote:
Hi Alex,
To quickly answer your first question, stats.grok.se is quasi-independent - it's run privately but uses the traffic data generated by WMF. If you want to process this yourself, it's available in hourly increments covering all pages on all Wikimedia sites, including all language editions of Wikipedia, with a day totalling about 2.5gb of compressed data. There are two formats:
http://dumps.wikimedia.org/other/pagecounts-raw/ - the "traditional" log files as used by stats.grok.se
http://dumps.wikimedia.org/other/pagecounts-all-sites/ - includes separated data for the mobile & Wikipedia Zero sites as well as the standard site. Probably substantially more useful for your purposes.
For your second question, not really, for privacy reasons. Some top-level data on overall traffic by country is available - see the rather fun https://ironholds.shinyapps.io/WhereInTheWorldIsWikipedia/
- but not on a per-page level.
Andrew.
On 27 April 2015 at 15:27, Alex Harrison alex.harrison@waytoblue.com wrote:
Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information about how their films are doing on social platforms over the course of their marketing campaign, up to a film’s release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ ) and am thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
Are you affiliated with the site above, or do you independently
cover traffic volume?
Do you know if it is possible to attribute a source country to
traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards
Alex
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
--
- Andrew Gray andrew.gray@dunelm.org.uk
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Reminds me of the work Next Big Sound has been doing.
Here's what the Next Big Sound dashboard looks like: http://i.imgur.com/MQBPfdm.png http://i.imgur.com/MQBPfdm.png
.. where "the music business is measuring success via popularity social media." (Including Wikipedia page views!)
According to Next Big Sound, Wikipedia page views by far have the strongest correlation to sales. (More than any other metric)
Check out their blog post for this and more information: http://blog.nextbigsound.com/post/37277146054/what-social-media-has-to-do-wi... http://blog.nextbigsound.com/post/37277146054/what-social-media-has-to-do-with-record-sales
See also:
* YouTube Developers Live: Next Big Sound Details their backend stack. Turns out it's written mostly in PHP, using Cassandra DB. https://www.youtube.com/watch?v=x3m_weExKhA https://www.youtube.com/watch?v=x3m_weExKhA
* Lightning talk: YouTube Analytics API This is where I found out about the Next Big Sound and saw Wikipedia in the screenshot. https://www.youtube.com/watch?v=nDnmeGgd8qc&t=4m51s https://www.youtube.com/watch?v=nDnmeGgd8qc&t=4m51s
— Krinkle
On 27 Apr 2015, at 15:27, Alex Harrison alex.harrison@waytoblue.com wrote:
Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information about how their films are doing on social platforms over the course of their marketing campaign, up to a film’s release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ http://stats.grok.se/ ) and am thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
Are you affiliated with the site above, or do you independently cover traffic volume?
Do you know if it is possible to attribute a source country to traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards Alex _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Reminds me of the work Next Big Sound has been doing.
Here's what the Next Big Sound dashboard looks like: http://i.imgur.com/MQBPfdm.png http://i.imgur.com/MQBPfdm.png
.. where "the music business is measuring success via popularity social media." (Including Wikipedia page views!)
According to Next Big Sound, Wikipedia page views by far have the strongest correlation to sales. (More than any other metric)
Check out their blog post for this and more information: http://blog.nextbigsound.com/post/37277146054/what-social-media-has-to-do-wi... http://blog.nextbigsound.com/post/37277146054/what-social-media-has-to-do-with-record-sales
See also:
* YouTube Developers Live: Next Big Sound Details their backend stack. Turns out it's written mostly in PHP, using Cassandra DB. https://www.youtube.com/watch?v=x3m_weExKhA https://www.youtube.com/watch?v=x3m_weExKhA
* Lightning talk: YouTube Analytics API This is where I found out about the Next Big Sound and saw Wikipedia in the screenshot. https://www.youtube.com/watch?v=nDnmeGgd8qc&t=4m51s https://www.youtube.com/watch?v=nDnmeGgd8qc&t=4m51s
— Timo
On 27 Apr 2015, at 15:27, Alex Harrison alex.harrison@waytoblue.com wrote:
Hi Wikipedia analysts,
I work in product development for the Insights team at Way To Blue. We provide reports to major film studios to provide them with information about how their films are doing on social platforms over the course of their marketing campaign, up to a film’s release in cinemas.
We typically focus on social media volumes (and traditionally split by Twitter, Forums, Blogs and News), however I just stumbled across the free Wikipedia article traffic statistics site (http://stats.grok.se/ http://stats.grok.se/ ) and am thinking that daily Wikipedia traffic could be an interesting additional metric for us to report. I have a few questions about daily Wikipedia traffic data:
Are you affiliated with the site above, or do you independently cover traffic volume?
Do you know if it is possible to attribute a source country to traffic count? E.g. I can see that the daily traffic on the Tomorrowland (film) page was 5773 yesterday, however it would be interesting what proportion of that is US vs. UK vs. etc.
We have never used Wikipedia traffic as a source, so if you are the wrong team to be asking about this please let me know!
Best regards Alex _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics