Hi all,
For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
<https://github.com/wikimedia/operations-puppet/blob/production/modules/role…>
script
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Beeline.
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
<http://phabricator.wikimedia.org/tag/analytics>.
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
Best,
--Madhu :)
Curious, what percentage of digital assistants (Alexa, Siri, Cortana,
Google) cite Wikipedia when a person asks a question?
Does the current Wikipedia mobile app support voice search?
Are there any reports on this? Thanks in advance!
Sincere regards,
Stella
--
Stella Yu | STELLARESULTS | 415 690 7827
"Chronicling heritage brands and legendary people."
We’re glad to announce the release of an aggregate clickstream dataset extracted from English Wikipedia
http://dx.doi.org/10.6084/m9.figshare.1305770 <http://dx.doi.org/10.6084/m9.figshare.1305770>
This dataset contains counts of (referer, article) pairs aggregated from the HTTP request logs of English Wikipedia. This snapshot captures 22 million (referer, article) pairs from a total of 4 billion requests collected during the month of January 2015.
This data can be used for various purposes:
• determining the most frequent links people click on for a given article
• determining the most common links people followed to an article
• determining how much of the total traffic to an article clicked on a link in that article
• generating a Markov chain over English Wikipedia
We created a page on Meta for feedback and discussion about this release: https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream <https://meta.wikimedia.org/wiki/Research_talk:Wikipedia_clickstream>
Ellery and Dario
Dear colleagues,
We are very excited to announce that Northwestern University and
Northwestern Institute on Complex Systems (NICO
<https://www.nico.northwestern.edu/>) is once again hosting the annual
International
Conference on Computational Social Science from July 12-15, 2018
<http://www.kellogg.northwestern.edu/news-events/conference/ic2s2/2018.aspx>.
Our Call for Abstracts for presentations has just opened, and our chairs
and committee members thought members of your group might be interested.
Below is a link with further information; feel free to forward to others if
interested as well.
https://www.kellogg.northwestern.edu/faculty/uzzi/IC2S2/IC2S
22018-CallForAbstracts.pdf
Sincerely,
Yasmeen Khan
Business Coordinator, NICO
Hi,
This conference could be interesting to some Wiki(p|m)edia researchers on
this list, in particular the session on "data science and machine learning
for development and humanitarian action":
---------------------------
SE04-HUM: “Data science and machine learning for development and
humanitarian action”
Session leader: Robert West, DLAB, EPFL
It is widely anticipated that data science in general and machine learning
in particular, will revolutionize our society as a whole. Due to ever
larger and more fine-grained data sets, as well as advances in computing
hardware and learning algorithms, we are bound to see a whole new world of
opportunities to bring about ground-breaking changes, which could expedite
the development of low- and middle-income countries. This session will look
into promising applications of data science in development and humanitarian
action.
---------------------------
Extended abstract are due on January 5. Please let me know if you have any
questions.
Bob
---------- Forwarded message ----------
From: EPFL Tech4Dev <tech4dev(a)epfl.ch>
Date: Tue, Nov 28, 2017 at 2:12 PM
Subject: TR: (Please Disseminate Through your Networks) Second Call for
Papers (Extended Abstracts): 5th International Conference of the UNESCO
Chair in Technologies for Development,Tech4Dev 2018. 27-29 June 2018,
SwissTech Convention Center, EPFL, Lausanne.
To: Robert West <robert.west(a)epfl.ch>
*Dear Colleagues, *
*Are you interested in the development of innovative technology solutions
to advance inclusive social and economic development in the Global South?*
*The Second call for Papers (Extended Abstracts)* for the *5th
International Conference of the UNESCO Chair in Technologies for
Development* has been officially launched.
*Tech4Dev 2018,* gives you an opportunity to:
Ø Present your research at a unique multidisciplinary Conference focused
on innovative technology for social impact in the Global South.
Ø Network across disciplines and fields of technology, to promote the
development, deployment, adaptation, and scaling of new solutions for the
Global South.
Ø Identify opportunities for collaboration with diverse stakeholders –
academics, students, engineers, entrepreneurs, policymakers, practitioners,
and social scientists- interested in technological innovation in the Global
South.
Ø Participate in the fabulous social event of the conference that will
take place in the Lavaux Vineyards, a UNESCO World Heritage Site.
Ø Build capacity among students and young professionals to engage in
multidisciplinary problem solving for social impact.
*Tech4Dev 2018* invites researchers, students, practitioners, industry or
anyone interested in critical issues in Technologies for Development to
submit proposals for Papers (Extended Abstracts).
<https://cooperation.epfl.ch/2016Tech4Dev/Call2/AbstractGuidelines_1>
Submissions
should emphasize the value of technological innovation while also
acknowledging the limits of technology in generating inclusive social and
economic development.
*Core Thematic Areas:*
• Technologies for *Humanitarian Action*
• *Medical Technologies*
• Science and Technology for *Disaster Risk Reduction*
• Technologies for *Sustainable Access to Energy*
• *ICT for Development*
• Technologies for *Sustainable Habitat and Cities*
*Crosscutting Themes: *
- Strengthening the research-policy nexus in the *implementation of the
SDGs*
- Opportunities and Challenges in *Quality (Rigorous) Impact
Evaluations:* Lessons from the academia and the field
- Development Engineering: *Training Global Engineers*
- *Open science:* an opportunity for the global south?
- Heart Money - the role of venture capitalism in *enabling social
outcomes*
- *Blockchain and the BoP*: a disruptive technology for economic
inclusion?
- *Development Engineering* in the Private Sector
- Building bridges *among global high-tech hubs in the African context*
Proposals for * Papers (Extended Abstracts)* should be submitted through
the conference’s online submission platform
<https://www.conftool.com/Tech4Dev2018>, using the prescribed template
<https://cooperation.epfl.ch/2016Tech4Dev/Call2/AbstractGuidelines_1#Submiss…>
no later than *5 January 2018. Papers (Extended Abstracts) should be
oriented towards the Conference’s Breakout Sessions. You can find the list
of Sessions on the Second Call for Papers Document (Attached to the
present Email). *
Further information, templates and material can be found on the conference
websitehttps://cooperation.epfl.ch/Tech4Dev2018
We would be grateful if you could also help us disseminate the attached
material to all your contacts, networks, etc.
If you could include a link to our website https://cooperation.epfl.ch/Te
ch4Dev2018 that would be much appreciated.
We are looking forward to seeing you in *Lausanne in June 2018!*
With Kind Regards,
*Alfredo Kägi*
UNESCO Chair Conference Coordinator
__________________________________________
Cooperation and Development Center
UNESCO Chair in Technologies for Development
EPFL-ENT-CODEV
Te: +41 (0)21 6935053 <+41%2021%20693%2050%2053>
Email: Tech4Dev(a)epfl.ch
CM 2 - Station 10,
CH-1015 Lausanne, Bureau CM2200
★ Visit our 2018 Conference on Technologies for Develpment website
<http://cooperation.epfl.ch/op/edit/2018Tech4Dev_1>
Interesting and timely CFP... -J
---------- Forwarded message ----------
From: Tiziana Catarci, ACM JDIQ Editor-in-Chief <pubs(a)acm.org>
Date: Fri, Nov 17, 2017 at 8:00 AM
Subject: JDIQ Call for Papers: Special issue on Combating Digital
Misinformation and Disinformation
To: jmorgan(a)wikimedia.org
ACM Journal of Data
and Information Quality
*Special issue on Combating Digital Misinformation and Disinformation*
Guest Editors
Naeemul Hassan, University of Mississippi
Chengkai Li, University of Texas at Arlington
Jun Yang, Duke University
Cong Yu, Google Research
------------------------------
*Context*
Spread of misinformation and disinformation is one of the most serious
challenges facing the news industry, and a threat to democratic societies
worldwide. The problem has reached an unprecedented level via social media,
where contents can be created and disseminated to a large audience with
little to zero cost, and revenues are driven by click-through rates.
Researchers from multiple disciplines have proposed various strategies,
built automated and semi-automated systems, and recommended policy changes
across the media ecosystem. Recently, researchers have also explored how
artificial intelligence techniques, particularly machine learning and
natural language processing, can be leveraged to combat falsehoods online.
In this special issue of JDIQ, we aspire to provide an overview of
innovative research primarily at the intersection of information
credibility, machine learning, and data science, from theory to practice,
with a focus on combating misinformation and disinformation.
*Topics*
Specific topics within the scope of the call include, but are not limited
to, the following:
- Automated question-answering for fact-checking
- Crowdsourced fact-checking
- Data collection, labeling and extraction for fact-checking
- Detection of fake-news spreading social bots
- Knowledge bases for fact-checking
- Models and methods for tracking the propagation and derivation of
online data
- Multi-modal deception detection
- Natural language processing approaches to fact checking
- Role of AI agents in fake news propagation
- Role of metadata and provenance management in assessing veracity of
online information
- Semantic parsing and verification of fake news
- Sustainable fact-checking framework
- Techniques to detect and limit misinformation and disinformation in
social media
- Truth discovery from structured and unstructured data
*Expected contributions:*
We welcome two types of contributions:
- Research manuscripts reporting mature results (up to 25 pages)
- Experience papers that report on lessons learned from addressing
specific issues within the scope of the call. These papers should be of
interest to the broad data quality community. (12+ pages plus an optional
appendix)
*Important dates and timeline:*
Initial submission: April 1, 2018
First review: July 1, 2018
Revised manuscripts: September 1, 2018
Second review: November 1, 2018
Camera-ready manuscripts: January 10, 2019
Publication: April 1, 2019
For further information and author instructions please visit jdiq.acm.org
<https://orange.hosting.lsoft.com/trk/click?ref=znwrbbrs9_6-1808ax3137cfx096…>,
or contact Paolo Missier <paolo.missier(a)newcastle.ac.uk> or Naeemul Hassan
<nhassan(a)olemiss.edu>.
------------------------------
UNSUBSCRIBE
<https://optout.acm.org/unsubscribe.cfm?re=jmorgan@wikimedia.org&rl=CFP> to
stop receiving emails about publishing in ACM journals.
Association for Computing Machinery, Two Penn Plaza, Suite 701, New York,
NY 10121-0701, USA
Copyright 2017, ACM, Inc.
--
Jonathan T. Morgan
Senior Design Researcher
Wikimedia Foundation
User:Jmorgan (WMF) <https://meta.wikimedia.org/wiki/User:Jmorgan_(WMF)>
Hello list!
I have written a short piece on how online communities are using
algorithmic tools to address issues of gender inequality, harassment, and
more in general how to create more inclusive environments online. And since
I know this is a topic has been discussed before here, I thought some of
you may be interested in reading it:
https://theconversation.com/can-online-gaming-ditch-its-sexist-ways-74493
The main topic is the online gaming platform Twitch, which is sadly in the
news for yet another episode of harassment, but I mention Wikipedia and
some of the initiatives to create more inclusive spaces (e.g. Teahouse).
Any feedback is really appreciated. Cheers!
Giovanni
--
Giovanni Luca Ciampaglia <glciampagl(a)gmail.com> ∙ Assistant Research
Scientist
IU Network Science Institute <http://iuni.iu.edu/> ∙ glciampaglia.com
News [image: 🕫]*WWW 2018* ∙ Alternate track on Journalism, Misinformation,
and Fact Checking:
https://www2018.thewebconf.org/call-for-papers/misinformation-cfp/
Possibly of interest to the wiki world!
If you have questions, just let me know.
Bob
---------- Forwarded message ----------
From: EPFL Tech4Dev <tech4dev(a)epfl.ch>
Date: Tue, Nov 14, 2017 at 2:18 PM
Subject: Second Call for Papers (Extended Abstracts): 5th International
Conference of the UNESCO Chair in Technologies for Development,Tech4Dev
2018. 27-29 June 2018, SwissTech Convention Center, EPFL, Lausanne.
To: "caroline(a)spidercenter.org" <
IMCEAINVALID-caroline+40spidercenter+2Eorg(a)intranet.epfl.ch>, "
d.jimenez(a)cgiar.org" <d.jimenez(a)cgiar.org>, Mohajeri Pour Rayeni Nahid <
nahid.mohajeri(a)epfl.ch>, "michele.calvello(a)gmail.com" <
IMCEAINVALID-michele+2Ecalvello+40gmail+2Ecom(a)intranet.epfl.ch>, Scarioni
Beatrice <beatrice.scarioni(a)epfl.ch>, "alan(a)aptivate.org" <alan(a)aptivate.org>,
"karen.sudmeier-rieux(a)unil.ch" <karen.sudmeier-rieux(a)unil.ch>, "
albrecht.ehrensperger(a)cde.unibe.ch" <
IMCEAINVALID-albrecht+2Eehrensperger+40cde+2Eunibe+2Ech(a)intranet.epfl.ch>, "
fk_sud(a)rediffmail.com" <
IMCEAINVALID-fk+5Fsud+40rediffmail+2Ecom(a)intranet.epfl.ch>, "
abhishek.jain(a)ceew.in" <
IMCEAINVALID-abhishek+2Ejain+40ceew+2Ein(a)intranet.epfl.ch>, "
m.leahy(a)endeva.org" <IMCEAINVALID-m+2Eleahy+40endeva+2Eorg(a)intranet.epfl.ch>,
"f.mao(a)bham.ac.uk" <IMCEAINVALID-f+2Emao+40bham+2Eac+2Euk(a)intranet.epfl.ch>,
Tombesi Paolo <paolo.tombesi(a)epfl.ch>, "Giuseppe.Faldi(a)ulb.ac.be" <
Giuseppe.Faldi(a)ulb.ac.be>, "cargalon(a)gmail.com" <cargalon(a)gmail.com>, "
tmadon(a)berkeley.edu" <tmadon(a)berkeley.edu>, "reymound-yaw.buckman(a)airbus.com"
<reymound-yaw.buckman(a)airbus.com>, "s3196258(a)student.rmit.edu.au" <
s3196258(a)student.rmit.edu.au>, Robert West <robert.west(a)epfl.ch>, "
Michel.jaboyedoff(a)unil.ch" <Michel.jaboyedoff(a)unil.ch>, "
jennifer.mckay(a)unisa.edu.au" <jennifer.mckay(a)unisa.edu.au>,
"edgar(a)SPIDERcenter.org" <edgar(a)spidercenter.org>, Pedrazzini Yves <
yves.pedrazzini(a)epfl.ch>, "aniebrasjoseph(a)gmail.com" <
aniebrasjoseph(a)gmail.com>, Nettra PAN <nettra.pan(a)epfl.ch>, "
anna.scolobig(a)usys.ethz.ch" <anna.scolobig(a)usys.ethz.ch>
*Dear Colleagues, *
*Are you interested in the development of innovative technology solutions
to advance inclusive social and economic development in the Global South?*
*The Second call for Papers (Extended Abstracts)* for the *5th
International Conference of the UNESCO Chair in Technologies for
Development* has been officially launched.
*Tech4Dev 2018,* gives you an opportunity to:
Ø Present your research at a unique multidisciplinary Conference focused
on innovative technology for social impact in the Global South.
Ø Network across disciplines and fields of technology, to promote the
development, deployment, adaptation, and scaling of new solutions for the
Global South.
Ø Identify opportunities for collaboration with diverse stakeholders –
academics, students, engineers, entrepreneurs, policymakers, practitioners,
and social scientists- interested in technological innovation in the Global
South.
Ø Participate in the fabulous social event of the conference that will
take place in the Lavaux Vineyards, a UNESCO World Heritage Site.
Ø Build capacity among students and young professionals to engage in
multidisciplinary problem solving for social impact.
*Tech4Dev 2018* invites researchers, students, practitioners, industry or
anyone interested in critical issues in Technologies for Development to
submit proposals for Papers (Extended Abstracts).
<https://cooperation.epfl.ch/2016Tech4Dev/Call2/AbstractGuidelines_1>
Submissions should emphasize the value of technological innovation while
also acknowledging the limits of technology in generating inclusive social
and economic development.
*Core Thematic Areas:*
• Technologies for *Humanitarian Action*
• *Medical Technologies*
• Science and Technology for *Disaster Risk Reduction*
• Technologies for *Sustainable Access to Energy*
• *ICT for Development*
• Technologies for *Sustainable Habitat and Cities*
*Crosscutting Themes: *
- Strengthening the research-policy nexus in the *implementation of the
SDGs*
- Opportunities and Challenges in *Quality (Rigorous) Impact
Evaluations:* Lessons from the academia and the field
- Development Engineering: *Training Global Engineers*
- *Open science:* an opportunity for the global south?
- Heart Money - the role of venture capitalism in *enabling social
outcomes*
- *Blockchain and the BoP*: a disruptive technology for economic
inclusion?
- *Development Engineering* in the Private Sector
- Building bridges *among global high-tech hubs in the African context*
Proposals for * Papers (Extended Abstracts)* should be submitted through
the conference’s online submission platform
<https://www.conftool.com/Tech4Dev2018>, using the prescribed template
<https://cooperation.epfl.ch/2016Tech4Dev/Call2/AbstractGuidelines_1#Submiss…>
no later than *5 January 2018. Papers (Extended Abstracts) should be
oriented towards the Conference’s Breakout Sessions. You can find the list
of Sessions on the Second Call for Papers Document (Attached to the
present Email). *
Further information, templates and material can be found on the conference
website https://cooperation.epfl.ch/Tech4Dev2018.
We would be grateful if you could also help us disseminate the attached
material to all your contacts, networks, etc.
If you could include a link to our website https://cooperation.epfl.ch/
Tech4Dev2018 that would be much appreciated.
We are looking forward to seeing you in *Lausanne in June 2018!*
With Kind Regards,
*Alfredo Kägi*
UNESCO Chair Conference Coordinator
__________________________________________
Cooperation and Development Center
UNESCO Chair in Technologies for Development
EPFL-ENT-CODEV
Te: +41 (0)21 6935053 <+41%2021%20693%2050%2053>
Email: Tech4Dev(a)epfl.ch
CM 2 - Station 10,
CH-1015 Lausanne, Bureau CM2200
★ Visit our 2018 Conference on Technologies for Develpment website
<http://cooperation.epfl.ch/op/edit/2018Tech4Dev_1>
Hi Everyone,
The next Research Showcase will be live-streamed this Wednesday, November
15, 2017 at 11:30 AM (PST) 18:30 UTC.
YouTube stream: https://www.youtube.com/watch?v=nMENRAkeHnQ
As usual, you can join the conversation on IRC at #wikimedia-research. And,
you can watch our past research showcases here
<https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase#November_2017>.
This month's presentation:
Conversation Corpora, Emotional Robots, and Battles with BiasBy *Lucas
Dixon (Google/Jigsaw)*I'll talk about interesting experimental setups for
doing large-scale analysis of conversations in Wikipedia, and what it even
means to grapple with the concept of conversation when one is talking about
revisions on talk pages. I'll also describe challenges with having good
conversations at scale, some of the dreams one might have for AI in the
space, and I'll dig into measuring unintended bias in machine learning and
what one can do to make ML more inclusive. This talk will cover work from
the WikiDetox <https://meta.wikimedia.org/wiki/Research:Detox> project as
well as ongoing research on the nature and impact of harassment in
Wikipedia discussion spaces
<https://meta.wikimedia.org/wiki/Research:Study_of_harassment_and_its_impact> –
part of a collaboration between Jigsaw, Cornell University, and the
Wikimedia Foundation. The ML model training code, datasets, and the
supporting tooling developed as part of this project are openly available.
Many kind regards,
Sarah R. Rodlund
Senior Project Coordinator-Product & Technology, Wikimedia Foundation
srodlund(a)wikimedia.org