Hi all,
For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
<https://github.com/wikimedia/operations-puppet/blob/production/modules/role…>
script
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Beeline.
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
<http://phabricator.wikimedia.org/tag/analytics>.
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
Best,
--Madhu :)
Curious, what percentage of digital assistants (Alexa, Siri, Cortana,
Google) cite Wikipedia when a person asks a question?
Does the current Wikipedia mobile app support voice search?
Are there any reports on this? Thanks in advance!
Sincere regards,
Stella
--
Stella Yu | STELLARESULTS | 415 690 7827
"Chronicling heritage brands and legendary people."
Hi everyone,
We are excited to announce that the 5th annual Wiki Workshop [1] will
take place in Lyon on April 24, 2018 and as part of The Web Conference
2018 (a.k.a. WWW2018) [2].
You can access the call for papers at
http://wikiworkshop.org/2018/#call . Please submit your ongoing or
completed research related to Wikimedia projects to the workshop. Note
that 2018-01-28 is the submission deadline if you want your paper to
appear in the proceedings, and 2018-03-11 is for all other papers.[3]
Following the past year's model, the workshop will have a set of
invited talks (Jon Kleinberg and Markus Kroetzsch have already
accepted our invitation [4] \o/), a poster session, and more.
Questions and comments are welcome. Otherwise, we're looking forward
to receiving your submissions and seeing you in Lyon in April. :)
Best,
Leila, on behalf of the organizers [5]
[1] http://wikiworkshop.org/2018/
[2] https://www2018.thewebconf.org/
[3] http://wikiworkshop.org/2018/#dates
[4] http://wikiworkshop.org/2018/#speakers
[5] http://wikiworkshop.org/2018/#organization
--
Leila Zia
Senior Research Scientist
Wikimedia Foundation
We’re glad to announce the release of an aggregate clickstream dataset extracted from English Wikipedia
http://dx.doi.org/10.6084/m9.figshare.1305770 <http://dx.doi.org/10.6084/m9.figshare.1305770>
This dataset contains counts of (referer, article) pairs aggregated from the HTTP request logs of English Wikipedia. This snapshot captures 22 million (referer, article) pairs from a total of 4 billion requests collected during the month of January 2015.
This data can be used for various purposes:
• determining the most frequent links people click on for a given article
• determining the most common links people followed to an article
• determining how much of the total traffic to an article clicked on a link in that article
• generating a Markov chain over English Wikipedia
We created a page on Meta for feedback and discussion about this release: https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream <https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream>
Ellery and Dario
Dear Users,
I thank you for your efforts. The Call for proposals for WikiIndaba 2018, the 3rd African Wikimedia community conference is now open. If you want to participate and share your experience, tools, skills, knowledge, opinions or ideas with mostly active African wikimedians, please submit your proposals to https://meta.m.wikimedia.org/wiki/WikiIndaba_conference_2018/Submissions. The deadline for that is January 15th.
Yours Sincerely,
Houcemeddine Turki
Dear Users,
I have the great honour to inform you that the Call for Proposals for WikiIndaba 2018 is now open. WikiIndaba 2018 is the 3rd conference of African Wikimedia movement and will give to participants the opportunity to share their Wikimedia-related experience and skills with a wide and active African Wikimedia audience. The conference will be held in Tunisia from 16 to 18 March 2018. If you want to participate to WikiIndaba and share your works and thoughts with African Wikimedians, feel free to submit your proposal in https://meta.m.wikimedia.org/wiki/WikiIndaba_conference_2018/Submissions. The deadline for giving proposals will be January 15th, 2018.
If you need a scholarship to attend WikiIndaba 2018, you can apply to it in https://docs.google.com/forms/d/e/1FAIpQLSdJJ2I0FBqp4SuiW5ypj-9lnLaAidUmhMs….
Looking forward to seeing you in Tunis next March.
Yours Sincerely,
Houcemeddine Turki
Felix Nartey
Isla Haddow-Flood
Dear Users,
I have the great honour to inform you that the Call for Proposals for WikiIndaba 2018 is now open. WikiIndaba 2018 is the 3rd conference of African Wikimedia movement and will give to participants the opportunity to share their Wikimedia-related experience and skills with a wide and active African Wikimedia audience. The conference will be held in Tunisia from 16 to 18 March 2018. If you want to participate to WikiIndaba and share your works and thoughts with African Wikimedians, feel free to submit your proposal in https://meta.m.wikimedia.org/wiki/WikiIndaba_conference_2018/Submissions. The deadline for giving proposals will be January 15th, 2018.
If you need a scholarship to attend WikiIndaba 2018, you can apply to it in https://docs.google.com/forms/d/e/1FAIpQLSdJJ2I0FBqp4SuiW5ypj-9lnLaAidUmhMs….
Looking forward to seeing you in Tunis next March.
Yours Sincerely,
Houcemeddine Turki
Felix Nartey
Isla Haddow-Flood
Cross-posting from analytics – very excited about this announcement.
Congrats on the launch!
---------- Forwarded message ----------
From: Nuria Ruiz <nuria(a)wikimedia.org>
Date: Wed, Dec 13, 2017 at 8:25 PM
Subject: [Analytics] Wikistats gets a facelift - Alpha Launch of Wikistats 2
To: "A mailing list for the Analytics Team at WMF and everybody who has an
interest in Wikipedia and analytics." <analytics(a)lists.wikimedia.org>
Hello from Analytics Team!
We are happy to announce the Alpha release of Wikistats 2. Wikistats has
been redesigned for architectural simplicity, faster data processing, and a
more dynamic and interactive user experience. First goal is to match the
numbers of the current system, and to provide the most important reports,
as decided by the Wikistats community (see survey) [1]. Over time, we will
continue to migrate reports and add new ones that you find useful. We can
also analyze the data in new and interesting ways, and look forward to
hearing your feedback and suggestions. [2]
You can go directly to Spanish Wikipedia
https://stats.wikimedia.org/v2/#/es.wikipedia.org
or browse all projects
https://stats.wikimedia.org/v2/#/all-projects
The new site comes with a whole new set of APIs, similar to our existing
Pageview API but with edit data. You can start using them today, they are
documented here:
https://wikitech.wikimedia.org/wiki/Analytics/AQS/Wikistats
FAQ:
Why is this an alpha?
There are features that we feel a full-fledged product should have that are
still missing, such as localization. The data-processing pipeline for the
new Wikistats has been rebuilt from scratch (it uses distributed-computing
tools such as Hadoop) and we want to see how it is used before calling it
final. Also while we aim to update data monthly, it will happen a few days
after the month rolls because of the amount of data to move and compute.
How about comparing data between two wikis?
You can do it with two tabs but we are aware this UI might not solve all
use cases for the most advanced Wikistats users. We aim to tackle those in
the future.
How do I file bugs?
Use the handy link in the footer: https://phabricator.wikimedia.
org/maniphest/task/edit/?title=Wikistats%20Bug&projectPHIDs=Analytics-
Wikistats,Analytics
How do I comment on design?
The consultation on design already happened but we are still watching the
talk page: https://www.mediawiki.org/wiki/Wikistats_2.0_Design_
Project/RequestforFeedback/Round2
[1] https://www.mediawiki.org/wiki/Analytics/Wikistats/
DumpReports/Future_per_report
[2] https://wikitech.wikimedia.org/wiki/Talk:Analytics/Systems/Wikistats
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
*Dario Taraborelli *Director, Head of Research, Wikimedia Foundation
wikimediafoundation.org • nitens.org • @readermeter
<http://twitter.com/readermeter>
Apologies, a clarification about the time:
This will be at 11:30AM (PST) 19:30 UTC.
On Mon, Dec 11, 2017 at 4:21 PM, Lani Goto <lgoto(a)wikimedia.org> wrote:
> Hi Everyone,
>
> The next Research Showcase will be live-streamed this Wednesday, December
> 13, 2017 at 11:15 AM (PST) 18:15 UTC.
>
> YouTube stream: https://www.youtube.com/watch?v=OoVwus1Owtk
>
> As usual, you can join the conversation on IRC at #wikimedia-research.
> And, you can watch our past research showcases here.
>
> This month's presentation:
> *The State of the Article Expansion Recommendation System*
> By Leila Zia
> Only 1% of English Wikipedia articles are labeled with quality class Good
> or better, and 37% of the articles are stubs. We are building an article
> expansion recommendation system to change this in Wikipedia, across many
> languages. In this presentation, I will talk with you about our current
> thinking of the vision and direction of the research that can help us build
> such a recommendation system, and share more about one specific area of
> research we have heavily focused on in the past months: building a
> recommendation system that can help editors identify what sections to add
> to an already existing article. I present some of the challenges we faced,
> the methods we devised or used to overcome them, and the result of the
> first line of experiments on the quality of such recommendations (teaser:
> the results are really promising. The precision and recall at 10 is 80%.)
>
>
> --
> Lani Goto
> Project Assistant, Engineering Admin
>
--
Lani Goto
Project Assistant, Engineering Admin