Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass-surveillance-breeds...
From the paper
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
+ Juliet, as this is something Communications may want to follow up given that stats.groke.se is not maintained by a Wikimedia Foundation member.
Thanks for sharing this.
Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Wed, Jan 18, 2017 at 9:00 AM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass- surveillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
For the record/archive: The link is http://stats.grok.se/ but the tool is no longer maintained, it seems (internal server error when entering an article name).
Interesting nonetheless.
Lodewijk
2017-01-18 18:56 GMT+01:00 Leila Zia leila@wikimedia.org:
- Juliet, as this is something Communications may want to follow up given
that stats.groke.se is not maintained by a Wikimedia Foundation member.
Thanks for sharing this.
Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Wed, Jan 18, 2017 at 9:00 AM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass-sur veillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Very interesting, I hope legal reads this. And I hope comms figures out a way to counter-act the public opinion that Wikipedia traffic is monitored by the government. Ops goes to such lengths to make sure that's not possible, they should get credit for that (and apparently the blog posts about the https switch are not enough).
On a technical note: we should revisit the idea of backfilling old data into AQS so people can do this type of research project on top of the pageview API, now that stats.grok is completely down.
On Wed, Jan 18, 2017 at 12:59 PM, Lodewijk lodewijk@effeietsanders.org wrote:
For the record/archive: The link is http://stats.grok.se/ but the tool is no longer maintained, it seems (internal server error when entering an article name).
Interesting nonetheless.
Lodewijk
2017-01-18 18:56 GMT+01:00 Leila Zia leila@wikimedia.org:
- Juliet, as this is something Communications may want to follow up given
that stats.groke.se is not maintained by a Wikimedia Foundation member.
Thanks for sharing this.
Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Wed, Jan 18, 2017 at 9:00 AM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass-sur veillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
On Thu, Jan 19, 2017 at 11:09 AM, Dan Andreescu dandreescu@wikimedia.org wrote:
On a technical note: we should revisit the idea of backfilling old data into AQS so people can do this type of research project on top of the pageview API, now that stats.grok is completely down.
I would love to have historical pageview data through the pageview API, and I know many other researchers and educators who would appreciate this addition immensely. I'd be happy to help with publicizing/championing this change; let me know if I can help in any way.
- J
On Wed, Jan 18, 2017 at 12:59 PM, Lodewijk lodewijk@effeietsanders.org wrote:
For the record/archive: The link is http://stats.grok.se/ but the tool is no longer maintained, it seems (internal server error when entering an article name).
Interesting nonetheless.
Lodewijk
2017-01-18 18:56 GMT+01:00 Leila Zia leila@wikimedia.org:
- Juliet, as this is something Communications may want to follow up
given that stats.groke.se is not maintained by a Wikimedia Foundation member.
Thanks for sharing this.
Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Wed, Jan 18, 2017 at 9:00 AM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass-sur veillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Dan Andreescu, 19/01/2017 20:09:
now that stats.grok is completely down.
It's not, AFAICT: http://stats.grok.se/en/200712/Britney_Spears Only the new data is missing (since January 2016), as stated on the FAQ https://en.wikipedia.org/wiki/User:Killiondude/stats#Are_there_known_dates_for_which_complete_sets_have_not_been_compiled_although_the_data_seems_to_be_available
Nemo
Thanks for clarification. I created a ticket T155785 https://phabricator.wikimedia.org/T155785 to disable 'impossible' queries and explain this on the tool page. Not sure who could do that, so if someone could add that person as subscriber and add the right tag, that'd be great.
Thanks, Lodewijk
2017-01-19 21:47 GMT+01:00 Federico Leva (Nemo) nemowiki@gmail.com:
Dan Andreescu, 19/01/2017 20:09:
now that stats.grok is completely down.
It's not, AFAICT: http://stats.grok.se/en/200712/Britney_Spears Only the new data is missing (since January 2016), as stated on the FAQ < https://en.wikipedia.org/wiki/User:Killiondude/stats#Are_ there_known_dates_for_which_complete_sets_have_not_been_ compiled_although_the_data_seems_to_be_available>
Nemo
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Dan Andreescu, 19/01/2017 23:42:
there are no ways to know if data is missing or there are actual gaps.
These were documented on the FAQ though, also based on Erik Zachte's analysis.
Erik's analysis I trust, but the FAQ wasn't maintained either so I wouldn't go by it.
On Jan 19, 2017 14:09, "Dan Andreescu" dandreescu@wikimedia.org wrote:
figures out a way to counter-act the public opinion that Wikipedia traffic is monitored by the government. Ops goes to such lengths to make sure that's not possible, they should get credit for that (and apparently the blog posts about the https switch are not enough).
I hope they don't go too far with that, AFAIK we're still quite vulnerable to traffic analysis by a passive listener. (no MITM, and notwithstanding any vulnerability mentioned elsewhere on this thread (because we're not using quic?))
and we leak SNI too.
see threads:
Zack started a few different threads on ~ 2013-08-16: https://lists.wikimedia.org/pipermail/wikitech-l/2013-August/071262.html
"take two" (2014-06-05): https://lists.wikimedia.org/pipermail/wikitech-l/2014-June/076876.html
On a technical note: we should revisit the idea of backfilling old data into AQS so people can do this type of research project on top of the pageview API,
I would use it. :-)
-Jeremy
It seems to me that decline in terrorism related pageviews started long before Snowden revelations (2011-2012?). See enclosed graph from wikipediatrends.com. [image: Inline image 1]
On Wed, Jan 18, 2017 at 6:00 PM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass- surveillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
stats.grok.se is advertised as being maintained by User:Henrik https://en.wikipedia.org/wiki/User:Henrik, who has been inactive since 2014. I have my doubts any requests to update the tool will get any where.
Anyway, I am very pleased to see there more interest in backfilling historical data into AQS. Access to this data is by far the most common request I've gotten in working on the Pageviews Analysis tools. I, too, am happy to help with this effort in any way! :)
~Leon
On Fri, Jan 20, 2017 at 3:27 AM, Alex Druk alex.druk@gmail.com wrote:
It seems to me that decline in terrorism related pageviews started long before Snowden revelations (2011-2012?). See enclosed graph from wikipediatrends.com. [image: Inline image 1]
On Wed, Jan 18, 2017 at 6:00 PM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass-sur veillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Thank you.
Alex Druk alex.druk@gmail.com (775) 237-8550 Google voice
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
I wrote a small R application that helps working with stats.grok.se api . It works with every language (unlike wikipediatrends.com).
[1] https://avnerkantor.shinyapps.io/WikipediaTrends/ [2] https://github.com/avnerkantor/WikipediaTrendsR/
Avner
On Fri, Jan 20, 2017 at 12:17 PM, Leon Ziemba musikanimal@wikimedia.org wrote:
stats.grok.se is advertised as being maintained by User:Henrik https://en.wikipedia.org/wiki/User:Henrik, who has been inactive since 2014. I have my doubts any requests to update the tool will get any where.
Anyway, I am very pleased to see there more interest in backfilling historical data into AQS. Access to this data is by far the most common request I've gotten in working on the Pageviews Analysis tools. I, too, am happy to help with this effort in any way! :)
~Leon
On Fri, Jan 20, 2017 at 3:27 AM, Alex Druk alex.druk@gmail.com wrote:
It seems to me that decline in terrorism related pageviews started long before Snowden revelations (2011-2012?). See enclosed graph from wikipediatrends.com. [image: Inline image 1]
On Wed, Jan 18, 2017 at 6:00 PM, Andrew Otto acotto@gmail.com wrote:
Saw this on reddit:
https://theintercept.com/2016/04/28/new-study-shows-mass-sur veillance-breeds-meekness-fear-and-self-censorship/
"This case study uses data on English language Wikipedia article view counts from the online service stats.grok.se, a portal maintained by a Wikimedia Foundation member. This portal provides access to a range of Wikipedia analytics, stats, and data.86 In particular, the portal aggregates Wikipedia article view data on a daily and monthly basis.87 This data at stats.grok.se has been used in a range of research, including studies involving market trends, health information access, and social-political change.88”
Just thought it might be of interest, especially considering WMF’s NSA lawsuit https://blog.wikimedia.org/2015/03/10/wikimedia-v-nsa/.
-Ao
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- Thank you.
Alex Druk alex.druk@gmail.com (775) 237-8550 Google voice
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics