Hi all,
For all Hive users using stat1002/1004, you might have seen a deprecation
warning when you launch the hive client - that claims it's being replaced
with Beeline. The Beeline shell has always been available to use, but it
required supplying a database connection string every time, which was
pretty annoying. We now have a wrapper
<https://github.com/wikimedia/operations-puppet/blob/production/modules/role…>
script
setup to make this easier. The old Hive CLI will continue to exist, but we
encourage moving over to Beeline. You can use it by logging into the
stat1002/1004 boxes as usual, and launching `beeline`.
There is some documentation on this here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Beeline.
If you run into any issues using this interface, please ping us on the
Analytics list or #wikimedia-analytics or file a bug on Phabricator
<http://phabricator.wikimedia.org/tag/analytics>.
(If you are wondering stat1004 whaaat - there should be an announcement
coming up about it soon!)
Best,
--Madhu :)
Hello Research, Mobile, and Design colleagues,
In case other people are interested who didn't attend the August Wikimedia
Activities Meeting, there was a design research presentation in the meeting
regarding personas of mobile Wikimedia users: https://youtu.be/yZPZmRQnkXU
On a related note, I would like to learn more about design research,
including about how design research interfaces with analytics and UX
design, and I would like to request that WMF have an office hour on this
topic.
Regards,
Pine
( https://meta.wikimedia.org/wiki/User:Pine )
More changes are coming for dumps, this time for Hungarian Wikipedia
(approximately 436,000 articles) and Arabic Wikipedia.(approximately
595,000 articles).
Pine
( https://meta.wikimedia.org/wiki/User:Pine )
---------- Forwarded message ---------
From: Ariel Glenn WMF <ariel(a)wikimedia.org>
Date: Mon, Aug 20, 2018 at 10:27 AM
Subject: [Wikitech-l] huwiki, arwiki to be treated as 'big wikis' and run
parallel jobs
To: Wikipedia Xmldatadumps-l <Xmldatadumps-l(a)lists.wikimedia.org>,
Wikimedia developers <wikitech-l(a)lists.wikimedia.org>
Starting September 1, huwiki and arwiki, which both take several days to
complete the revsion history content dumps, will be moved to the 'big
wikis' list, meaning that they will run jobs in parallel as do frwiki,
ptwiki and others now, for a speedup.
Please update your scripts accordingly. Thanks!
Task for this: https://phabricator.wikimedia.org/T202268
Ariel
_______________________________________________
Wikitech-l mailing list
Wikitech-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikitech-l
Hello everyone,
I am writing in the hope that some of you do not only do great research in
the Wikimedia spaces, but are also editors. Particularly, editors of
smaller Wikipedias (i.e. all Wikipedias but English, French, Dutch, German,
Spanish, Italian).
I am a PhD student at the University of Southampton, and we have been
working on supporting editors with automated text generation [1].
We would like to extend the research in this direction by conducting a
series of interviews with editors (either in person or via skype) to
understand in more detail how we can support the community in the future.
The interviews should take about 30 minutes each and will happen end of
August and in September. More information can be found here:
https://github.com/luciekaffee/Announcements/blob/master/Interviews-Partici…
If you would be interested in participating, please let me know. If you
might now an editor, that could be interested, please connect us! It will
help us a great deal to understand how to support Wikipedia editors better,
particularly of the smaller sized Wikipedias.
I am looking forward to hearing from you!
Thanks,
Lucie
[1] Mind the (Language) Gap: Generation of Multilingual Wikipedia Summaries
from Wikidata for ArticlePlaceholders, Kaffee, Elsahar, Vougiouklis et al.,
2018,
https://2018.eswc-conferences.org/wp-content/uploads/2018/02/ESWC2018_paper…
--
Lucie-Aimée Kaffee
Web and Internet Science Group
School of Electronics and Computer Science
University of Southampton
Applications are invited for a postdoctoral position in the EPFL Data
Science Lab in Switzerland, headed by Prof. Robert West. The ideal start is
early 2019. The position will be for a period of one to two years.
Apply here: https://dlab.epfl.ch/2018-08-17-postdoc-position/
If you'd like to spread the word, retweet
https://twitter.com/cervisiarius/status/1031484218868682752
or share
https://www.linkedin.com/pulse/postdoc-position-available-epfl-data-science…
Description
Research in the EPFL Data Science Lab aims to make sense of large amounts
of data. Frequently, the data we analyze is collected on the Web, e.g.,
using server logs, social media, wikis, online news, online games, etc. We
distill heaps of raw data into meaningful insights and useful applications
by developing and applying algorithms and techniques in the field of data
science, broadly construed, at the crossroads of social and information
network analysis, machine learning, computational social science, data
mining, and natural language processing.
The Lab was founded in late 2016 and is currently in a phase of rapid and
enthusiastic growth. Therefore, candidates must be highly motivated to
actively participate in building the Lab and shaping its future direction.
The successful candidate will be an engine, rather than a cog in the
machine. They will advance the visibility of the Lab by advancing their own
visibility. They will lead innovative research projects in a stimulating,
open, and international research environment consisting of many highly
talented and motivated students, embedded in a strong network of academic
and industrial collaborators.
Other benefits include a competitive salary (around CHF 82,000 p.a.), an
extremely well funded national research system, an office next to a
stunning lake and even more stunning mountains, and generous travel support
(in order to see something less stunning once in a while).
There are no teaching requirements (except the occasional stand-in when
Prof. West is traveling).
Time frame
The ideal start date is early 2019. There is, however, some flexibility.
The position will be for a period of one to two years.
QualificationsCandidates should have completed, or be near completion of, a
PhD with a strong international publication record in areas such as (but
not limited to) machine learning, data mining, social network analysis,
network science, computational social science, natural language processing,
information retrieval, etc. The successful candidate will do innovative and
creative work in the space spanned by these bases, and the potential to do
so counts more than the exact coördinates in this space. Strong programming
skills are required.
About EPFL
EPFL ranks among the world’s top universities in the field of computer
science. It is located in Lausanne, Switzerland, a beautiful and vibrant
city in an Alpine setting on the shores of scenic Lake Geneva, in the very
heart of Europe. English is the main language spoken at EPFL, and Lausanne
is highly international, such that no French language skills are required.
How to apply
Please follow the instructions here:
https://dlab.epfl.ch/2018-08-17-postdoc-position/
Good morning/afternoon/evening everyone,
If you are an editor of the French, Italian or English Wikipedia, and you
are curious about how to contribute to technologies for improving
verifiability of Wikipedia articles, please read on—we need your help!
In the context of the Knowledge integrity
<https://meta.wikimedia.org/wiki/Knowledge_Integrity> program, we (the
WMF Research
team <http://research.wikimedia.org>) are studying ways to flag unsourced
statements needing a citation
<https://meta.wikimedia.org/wiki/Research:Identification_of_Unsourced_Statem…>
using machine learning, with the aim of identifying areas where adding high
quality citations is particularly urgent or important. Following the
success of the first labeling campaign
<https://meta.wikimedia.org/wiki/Research:Identification_of_Unsourced_Statem…>,
we now need to collect additional, high-quality labeled data regarding
why sentences
need citations.
You are invited to participate in a second annotation task
<https://meta.wikimedia.org/wiki/Research:Identification_of_Unsourced_Statem…>.
We used your input from the last experiment to generate a taxonomy of
reasons
<https://meta.wikimedia.org/wiki/Research:Identification_of_Unsourced_Statem…>
why editors add citations. With this taxonomy now embedded in the
interface, the annotation experience will be much faster and fun.
If you are interested in participating, please go to
http://labels.wmflabs.org/ui/enwiki/ (replace enwiki with itwki or frwiki
if you speak Italian or French), login, and from *'**Labeling Unsourced
Statements II’**,* request one (or more) workset. For each task in a
workset, the tool will show you an unsourced sentence in an article and ask
you to annotate it. You can then label the sentence as needing an inline
citation or not, and specify a reason for your choice from a drop-down
menu. If you can't respond please select 'skip'. You can also sign up by
(optionally) adding your name on this page
<https://meta.wikimedia.org/wiki/Research:Identification_of_Unsourced_Statem…>
to receive updates about future campaigns and results from this research
If you have any question/comment on this project, please let us know by
contacting miriam(a)wikimedia.org or leaving a message on the talk page of
the project
<https://meta.wikimedia.org/wiki/Research_talk:Identification_of_Unsourced_S…>.
Thank you for your time!
Miriam, Jonathan, and Dario
--
Jonathan T. Morgan
Senior Design Researcher
Wikimedia Foundation
User:Jmorgan (WMF) <https://meta.wikimedia.org/wiki/User:Jmorgan_(WMF)>
Four Open-Rank Tenure-Track Faculty Positions
Syracuse University School of Information Studies
Syracuse University's School of Information Studies (The iSchool, ischool.syr.edu) seeks scholars and leaders to fill four open-rank tenure-track faculty positions to start in Fall 2019. Successful candidates will have a productive program of research in an information-related field and be able to contribute to the development of students and courses in our degree programs in information management and technology, data science and data analytics, library and information science (including school media) and information science and technology.
The successful candidates will join our “Faculty of One”: a highly collegial environment that stresses interdisciplinary collaboration amongst our school's faculty and with other members of the university community and beyond. Our research and teaching often adopt a socio-technical approach, recognizing that important problems are not simply technical nor just about people, but rather require both social and technological insights. We seek applicants whose topic areas and skills adopt this philosophy, and who can speak to overlapping areas within the school.
We are particularly seeking applications from researchers whose interests are located in one or more of the following scholarly areas:
* technical, behavioral and/or social approaches to address privacy and security for trustworthy cyberspace
* computational social science
* digital humanities
* big data approaches to exploring important organizational, scientific, social, economic, cultural or political questions
* information and knowledge management with big data
* community-focused librarianship in K-12, academic, special or public libraries
* information literacy, especially ways to increase users’ resilience to misinformation or to privacy and security attacks
* library services, such as youth or reference services
* information organization and retrieval
* human-computer interaction (HCI), user experience and/or user behaviour
* design and evaluation of interactive, social, ubiquitous and/or other emerging computing systems
* designing for marginalized populations
* ethical and policy implications of digital technologies and design
We specifically seek applications from women and from members of groups traditionally underrepresented among scholars in higher education. We are interested in candidates who have the communication skills and cross-cultural abilities to be effective with diverse groups of students, colleagues and community members. Experience mentoring students from marginalized groups is particularly valued.
Rank and experience level of these positions are open: we encourage applications from both junior and senior scholars with a record of achievement appropriate to the rank sought at time of application. A completed Ph.D. in a relevant field of study or the expectation of completion of the Ph.D. by August 2019 is required. The School is committed to professional development for junior faculty, and provides excellent mentoring and support.
Application process
Applications—including 1) a cover letter outlining the applicant’s interests and qualifications and including the rank sought; 2) a current curriculum vitae; 3) short statements describing interests and accomplishments in research and in teaching; and 4) names and contact information of at least three references—can be submitted at https://www.sujobopps.com/postings/76240.
All applications will be held in strict confidence; we will seek references only from finalists. We we are pleased to speak with interested applicants ahead of submitting materials.
We will begin screening applicants in October 2018 and continue until the positions are filled, so applications should be received by 7 October 2018 to ensure full consideration. Direct questions to Dr. Kevin Crowston, search committee chair, crowston(a)syr.edu.
About the iSchool at Syracuse University
Located at the center of picturesque Syracuse University, the iSchool prides itself on being a thought leader in both scholarship and instruction. Our faculty have recognized strengths in information retrieval, information management, library programs and services, natural language processing, computational social science, online communities and civic participation, new forms of organization and collaboration, information and communications policy, smart energy systems, digital literacy, information privacy and security, globalization, data science, entrepreneurship, social media, social computing and other areas.
The iSchool has five degree programs and numerous certificate programs, with an enrollment of 31 doctoral students, 873 masters students and 685 undergraduate majors, led by 44 full-time faculty and more than 100 part-time faculty. The iSchool is ranked #4 overall by US News and World Report for library and information science and #2 for information systems. Faculty teach in the classroom and/or prepare and oversee delivery of online courses (with a typical allocation of two courses per semester), and mentor and advise undergraduate, masters and doctoral students.
iSchool faculty members received more than $5M in external research support in the past year. The iSchool hosts seven research centers and laboratories and is recognized as a National Center of Academic Excellence (CAE) in Research and in Information Assurance/Cyber Defense (IA/CD) by the National Security Agency and the Department of Homeland Security.
Kevin Crowston
Associate Dean for Research, Distinguished Professor of Information Science
School of Information Studies
+1 (315) 443.1676
crowston(a)syr.edu<mailto:crowston@syr.edu>
348 Hinds Hall, Syracuse, NY 13244
crowston.syr.edu <http://crowston.syr.edu/>
Syracuse University
Most recent publication: Lee, T. K., Crowston, K., Harandi, M., Østerlund, C. & Miller, G. (2018). Which motives are most effective in recruiting citizen scientists? Results of a field experiment. Journal of Science Communication. doi: 10.22323/2.17010202.
Check out our new research coordination network on Work in the Age of Intelligent Machine: http://waim.network/
Dear Mr. or Ms.,
I thank you for your efforts. We are beginning a new project to enrich Wikidata with high-scale information about the risk factors of diseases. If you are interesting in the project, you can join the discussion within two hours in Skype at 3PM UTC about the project and the PubMed-based method of the automatic enrichment of Wikidata we will use. My account is csisc1994. Just to know if there will be participants to this RiskData second meeting, please confirm your participation by answering this email.
Yours Sincerely,
Houcemeddine Turki
Dear Mr. or Ms.,
I thank you for your efforts. Due to the lack of participants, the Skype meeting was reported to 3PM GMT http://www.timebie.com/std/gmt.php?q=15. The meeting is about converting Wikidata into a high-scale database of risk factors of diseases. You can discover our method of enriching Wikidata from PubMed with medical knowledge. My account is csisc1994.
Yours Sincerely,
Houcemeddine Turki
3:00 PM 15:00 GMT to Local Time Conversion -- TimeBie<http://www.timebie.com/std/gmt.php?q=15>
3 PM ( 15:00 ) Greenwich Mean Time to Your Local Time and Worldwide Time Conversions
www.timebie.com
Dear Mr. or Ms.,
I thank you for your answer. I thank you as well for your interest in my research work. For those who cannot attend the meeting in 11AM GMT, they can attend a second meeting in 3PM GMT. If you would like to participate to the second meeting, please just inform me about that and I will send you my username.
Yours Sincerely,
Houcemeddine Turki
________________________________
De : Wikimedia-Medicine <wikimedia-medicine-bounces(a)lists.wikimedia.org> de la part de Scott MacLeod <sgkmacleod(a)worlduniversityandschool.org>
Envoyé : vendredi 17 août 2018 18:12
À : Wiki Medicine discussion
Cc : Wikidata technical discussion; Discussion list for the Wikidata project.; wiki-research-l(a)lists.wikimedia.org
Objet : Re: [Wiki-Medicine] Invitation to discuss RiskData:, Wikidata as a high-scale database of risk factors for human diseases
Hi Nancy, Houcemeddine, Wiki Medicine and Wikimedians,
Thanks for your emails.
I thought that you had written 11am PDT and not 11am GMT on Saturday, August 18th, Houcemeddine, so unfortunately I can NOT make this time (http://www.thetimezoneconverter.com/). Please keep me posted if you decide to meet again at a different time. Thank you.
Sincerely, Scott
Scott_WUaS
Dear Mr. Houcemeddine Turki, WikiMedicine, Wikidatans (and Wikimedians),
I would like to participate in this Skype conference (and with regard too to WUaS's planned online medical schools, with online teaching hospitals for online clinical care - planned in each of all ~200 countries' official and main languages). Please send me your Skype username off-list. Thank you, Houcemeddine.
...
Here's the beginning online Medical School at WUaS in English - https://wiki.worlduniversityandschool.org/wiki/World_University_Medical_Sch… - which will connect with Wikidata as its "backend" eventually. And WUaS also seeks to emerge out of Stanford Medicine and with OpenCourseWare in multiple languages for our medical schools.
Sincerely, Scott
Scott_WUaS
On Fri, Aug 17, 2018 at 9:56 AM, Nancy Gertrudiz <nancy.gertrudiz(a)gmail.com<mailto:nancy.gertrudiz@gmail.com>> wrote:
I am interested. my skype user ngertrudiz.
Just confirm, tomorrow 17 Aug, 11 AM GTM?
Best,
2018-08-17 10:26 GMT-05:00 Houcemeddine A. Turki <turkiabdelwaheb(a)hotmail.fr<mailto:turkiabdelwaheb@hotmail.fr>>:
Dear Mr. or Ms.,
I thank you for your efforts. As risk factor Wikidata property is added to Wikidata yesterday night, we will use it to enrich Wikidata with the risk factors of diseases using an automatic method of bibliometric-enhanced retrieval of biomedical relations. As we would like to have your opinions about this automatic method, we invite you for a Skype discussion tomorrow at 11 AM (GMT). For those who would like to participate to this meeting, please reply to this email and confirm your participation. I will send you my Skype username.
Yours Sincerely,
Houcemeddine Turki
_______________________________________________
Wikimedia-Medicine mailing list
Wikimedia-Medicine(a)lists.wikimedia.org<mailto:Wikimedia-Medicine@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/wikimedia-medicine
--
Nancy Gertrudiz
mobile (+52) 5554192839
t: @ngertrudiz
_______________________________________________
Wikimedia-Medicine mailing list
Wikimedia-Medicine(a)lists.wikimedia.org<mailto:Wikimedia-Medicine@lists.wikimedia.org>
https://lists.wikimedia.org/mailman/listinfo/wikimedia-medicine
--
--
- Scott MacLeod - Founder, President & Professor
- World University and School
- http://worlduniversityandschool.org<http://worlduniversityandschool.org/>
- 415 480 4577
- http://scottmacleod.com<http://scottmacleod.com/>
- CC World University and School - like CC Wikipedia with best STEM-centric CC OpenCourseWare - incorporated as a nonprofit university and school in California, and is a U.S. 501 (c) (3) tax-exempt educational organization.
IMPORTANT NOTICE: This transmission and any attachments are intended only for the use of the individual or entity to which they are addressed and may contain information that is privileged, confidential, or exempt from disclosure under applicable federal or state laws. If the reader of this transmission is not the intended recipient, you are hereby notified that any use, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this transmission in error, please notify me immediately by email or telephone.