Hello,
I am finishing my PhDs and I think that you could be interested in my last
main work about the quality of Wikipedia :
https://www.linkedin.com/pulse/standardization-wikipedia-articles-according…
and in a future collaboration.
I would be very grateful for your feedbacks ! Several publications are in
preparation... Let me know if you are interested in following this thread...
Have a nice week,
Ludovic BOCKEN
lbocken(a)gmail.com
www.ludovicbocken.com
Skype: ludovic.bocken
http://www.linkedin.com/in/ludovicbocken
2222 Rue Hochelaga,
Montréal, QC H2K 4N8
+1 (514) 649 0755
*Avis de confidentialité*
Le présent message transmis par télécopie est confidentiel, et son contenu
peut être protégé par le secret professionnel. Il est à l’usage exclusif de
son ou sa destinataire. Toute autre personne est par les présentes avisée
qu’il lui est strictement interdit de le diffuser, de le distribuer ou de
le reproduire. Si la ou le destinataire ne peut être joint ou vous est
inconnu, nous vous prions d’en informer immédiatement l’expéditeur ou
l’expéditrice et de détruire ce message et toute copie de celui-ci.
Hi everybody,
stat1005 was replaced almost a year ago by stat1007 to allow GPU research
and testing (https://phabricator.wikimedia.org/T148843). After a
long journey we are happy to add stat1005 back in the pool of available
Analytics client hosts. I have updated the documentation in:
https://wikitech.wikimedia.org/wiki/Stat1005https://wikitech.wikimedia.org/wiki/Analytics/Data_access#Analytics_clients
The host is now an Hadoop client like stat1004, and also offers an AMD GPU (
https://wikitech.wikimedia.org/wiki/Analytics/Systems/Cluster/AMD_GPU). For
the moment we are limiting the access to the GPU to people that explicitly
need it, since it is still a testing environment and we'd like to give some
priority to people that already have projects relying on it. The end goal
is to give access to the GPU by default to everybody, so stay tuned. If you
wish to participate to the testing efforts, please reach out to the
Analytics team! (
https://wikitech.wikimedia.org/wiki/Analytics/Data_access#GPU_usage).
Last but not the least - stat1005 is running Debian 10 (Buster), and
openjdk-8 instead of the ones shipped by Debian (openjdk-11) since the
Hadoop cluster is not ready to migrate yet. Everything seem running fine
from our tests, but please report to us anything that looks strange.
Thanks in advance!
Luca (on behalf of the Analytics team)
Dear all,
I thank you for your efforts. As requested by many interested scientists, I put the full text of the work in Zenodo. It is currently available at https://zenodo.org/record/3461198.
Yours Sincerely,
Houcemeddine Turki (he/him)
Medical Student, Faculty of Medicine of Sfax, University of Sfax, Tunisia
GLAM and Education Coordinator, Wikimedia TN User Group
Member, Wiki Project Med
Member, WikiIndaba Steering Committee
Member, Wikimedia and Library User Group Steering Committee
____________________
+21629499418
-------- Message d'origine --------
De : Houcemeddine Turki <turkiabdelwaheb(a)hotmail.fr>
Date : 2019/09/25 13:12 (GMT+01:00)
À : wiki-research-l(a)lists.wikimedia.org, Paladox via Wikitech-l <wikitech-l(a)lists.wikimedia.org>, wikidata(a)lists.wikimedia.org, wikimedia-medicine(a)lists.wikimedia.org
Objet : Our research paper about Wikidata and Health has been published
Dear all,
I thank you for your efforts. I am honoured to inform you that our latest research paper about Wikidata and Health has been published in Journal of Biomedical Informatics (IF=2.9). The paper is available at https://doi.org/10.1016/j.jbi.2019.103292. This paper is the first one of our series of research publications about Medical Wikidata project. If you like to have the paper, please contact me and I will send the PDF to you. Our next papers will work on finding methods to ameliorate the coverage and quality of medical information in Wikidata. We are quite sure that our work is important as it will be provide trustworthy reference medical information that can be used by physicians and computer programs to process medical data and enhance the efficiency of health care.
Yours Sincerely,
Houcemeddine Turki (he/him)
Medical Student, Faculty of Medicine of Sfax, University of Sfax, Tunisia
GLAM, Research and Education Coordinator, Wikimedia TN User Group
Member, Wiki Project Med
Member, WikiIndaba Steering Committee
Member, Wikimedia and Library User Group Steering Committee
____________________
+21629499418
The August 2019 issue of the Wikimedia Research Newsletter is out:
https://meta.wikimedia.org/wiki/Research:Newsletter/2019/August
22 recent publications about Wikipedia's gender gaps and potential gender biases, summarized and reviewed in the latest edition of our monthly research newsletter.
In this issue:
1 Female and nonwhite US sociologists less likely to have Wikipedia articles than scholars of similar citation impact2 "Mapping and Bridging the Gender Gap: An Ethnographic Study of Indian Wikipedians and Their Motivations to Contribute"3 "Investigating the Gender Pronoun Gap in Wikipedia"4 Safety and women editors on Wikipedia5 "Hacking History: Redressing Gender Inequities on Wikipedia Through an Editathon"6 "'(Weitergeleitet von Journalistin)': The Gendered Presentation of Professions on Wikipedia"7 "Striking result": No bias against contributions by female editors in quality assessment8 Other recent publications8.1 "Unexpected forms of bias" on Wikipedia favor female over male CEOs8.2 English Wikipedia biased against conservative and female topics, at least when compared to US magazines8.3 "Gender and deletion on Wikipedia"8.4 "Deleted gender wars"8.5 "Gender gap through time and space: A journey through Wikipedia biographies via the Wikidata Human Gender Indicator"8.6 Special issue of the "Nordic journal for information science and dissemination of culture"8.7 "Breastfeeding, Authority, and Genre: Women's Ethos in Wikipedia and Blogs"8.8 "Cyberfeminism on Wikipedia: Visibility and deliberation in feminist Wikiprojects"8.9 "Women and Wikipedia. Diversifying Editors and Enhancing Content through Library Edit-a-Thons"8.10 "Similar Gaps, Different Origins? Women Readers and Editors at Greek Wikipedia"8.11 "Writing Women in Mathematics into Wikipedia"8.12 "How do students trust Wikipedia? An examination across genders"8.13 "Breaking the glass ceiling on Wikipedia"8.14 Using Wikipedia for "Analyzing Gender Stereotyping in Bollywood Movies"8.15 Contrary to expectations, "no evidence of discrimination of female users based on their usernames"8.16 Some informative non-research overview publications
*** 23 recent publications were covered or listed in this issue ***
Masssly and Tilman Bayer
---
Wikimedia Research Newsletterhttps://meta.wikimedia.org/wiki/Research:Newsletter/* Follow us on Twitter: @WikiResearch
* Like us on Facebook: Facebook.com/WikiResearch/
* Receive this newsletter by mail: Research-newsletter Mailing List - Wikimedia
* Subscribe to the RSS feed:
http://blog.wikimedia.org/c/research-2/wikimedia-research-newsletter/feed
Dear all,
I thank you for your efforts. I am honoured to inform you that our latest research paper about Wikidata and Health has been published in Journal of Biomedical Informatics (IF=2.9). The paper is available at https://doi.org/10.1016/j.jbi.2019.103292. This paper is the first one of our series of research publications about Medical Wikidata project. If you like to have the paper, please contact me and I will send the PDF to you. Our next papers will work on finding methods to ameliorate the coverage and quality of medical information in Wikidata. We are quite sure that our work is important as it will be provide trustworthy reference medical information that can be used by physicians and computer programs to process medical data and enhance the efficiency of health care.
Yours Sincerely,
Houcemeddine Turki (he/him)
Medical Student, Faculty of Medicine of Sfax, University of Sfax, Tunisia
GLAM, Research and Education Coordinator, Wikimedia TN User Group
Member, Wiki Project Med
Member, WikiIndaba Steering Committee
Member, Wikimedia and Library User Group Steering Committee
____________________
+21629499418
Hi everyone,
We’re preparing for the September 2019 research newsletter and looking for contributors. Please take a look at https://etherpad.wikimedia.org/p/WRN201909 and add your name next to any paper you are interested in covering. Our target publication date is 30 September 11:59 UTC. As usual, short notes and one-paragraph reviews are most welcome.
Highlights from this month:
- DBpedia FlexiFusion: The Best of Wikipedia > Wikidata > Your Data
- Improving Neural Question Generation using World Knowledge
- Introduction to Neural Network based Approaches for Question Answering over Knowledge Graphs
- ORES: Lowering Barriers with Participatory Machine Learning in Wikipedia
- The Global Popularity of William Shakespeare in 303 Wikipedias
- The use of collaborative open-access publishing via Wikipedia in university education to embed digital citizenship skills
- Wikidata from a Research Perspective -- A Systematic Mapping Study of Wikidata
- Wikipedia as complementary formative assessment method in University Courses
Masssly and Tilman Bayer
[1] http://meta.wikimedia.org/wiki/Research:Newsletter[2]https://twitter.com/WikiResearch
Dear Sir or Madame,
On behalf of Emilio Zagheni, I would like to draw the attention to several academic job openings at the Laboratory of Digital and Computational Demography at the Max Planck Institute for Demographic Research. The details of the job vacancy and how to apply can be found on their respective websites:
For W2 Research Faculty Position (equivalent to Associate Professor) in the Lab of Digital and Computational Demography
https://www.demogr.mpg.de/en/education_career/jobs_fellowships_1910/w2_rese…
For the Post-Docs/Research Scientists:
https://www.demogr.mpg.de/en/education_career/jobs_fellowships_1910/postdoc…
For the PhD student positions:
https://www.demogr.mpg.de/en/education_career/jobs_fellowships_1910/phd_stu…
For Summer Research Visits:
https://www.demogr.mpg.de/en/education_career/jobs_fellowships_1910/summer_…
Please kindly distribute these vacancies among your fellow researchers and students at your institution.
Data protection notice:
We use your data exclusively to inform you about current news from the MPIDR. Please use the following contact to obtain information on personal data stored about you or to have the data changed at any time:
career(a)demogr.mpg.de<mailto:career@demogr.mpg.de>
Should you no longer wish to receive news from the MPIDR, please click the following link:
Unsubscribe from MPIDR news distribution list<mailto:career@demogr.mpg.de?subject=Unsubscribe&body=I%20would%20like%20to%20unsubscribe%20from%20MPIDR%20news.>
Information on data protection can be accessed at any time on the website of the Max Planck Institute for Demographic Research (https://www.demogr.mpg.de/en/privacy_policy_5725/default.htm).
Thanks so much!
With best wishes,
Antje Gosselck
Max Planck Institute for Demographic Research
Konrad-Zuse-Str. 1
D-18057 Rostock
Germany
http://www.demogr.mpg.de
mailto:gosselck@demogr.mpg.de
Tel. +49 (0) 381 / 2081 108
Fax +49 (0) 381 / 2081 408
----------
This mail has been sent through the MPI for Demographic Research. Should you receive a mail that is apparently from a MPI user without this text displayed, then the address has most likely been faked. If you are uncertain about the validity of this message, please check the mail header or ask your system administrator for assistance.
Hello everyone,
The next Research Showcase will be live-streamed next Wednesday, September
18, at 9:30 AM PT/16:30 UTC. This will be the new time going forward for
Research Showcases in order to give more access to other timezones.
YouTube stream: https://www.youtube.com/watch?v=fDhAnHrkBks
As usual, you can join the conversation on IRC at #wikimedia-research. You
can also watch our past research showcases here:
https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
Citation Needed: A Taxonomy and Algorithmic Assessment of Wikipedia's
Verifiability
By Miriam Redi, Research, Wikimedia Foundation
Among Wikipedia's core guiding principles, verifiability policies have a
particularly important role. Verifiability requires that information
included in a Wikipedia article be corroborated against reliable secondary
sources. Because of the manual labor needed to curate and fact-check
Wikipedia at scale, however, its contents do not always evenly comply with
these policies. Citations (i.e. reference to external sources) may not
conform to verifiability requirements or may be missing altogether,
potentially weakening the reliability of specific topic areas of the free
encyclopedia. In this project
<https://meta.wikimedia.org/wiki/Research:Identification_of_Unsourced_Statem…>,
we aimed to provide an empirical characterization of the reasons why and
how Wikipedia cites external sources to comply with its own verifiability
guidelines. First, we constructed a taxonomy of reasons why inline
citations are required by collecting labeled data from editors of multiple
Wikipedia language editions. We then collected a large-scale crowdsourced
dataset of Wikipedia sentences annotated with categories derived from this
taxonomy. Finally, we designed and evaluated algorithmic models to
determine if a statement requires a citation, and to predict the citation
reason based on our taxonomy. We evaluated the robustness of such models
across different classes of Wikipedia articles of varying quality, as well
as on an additional dataset of claims annotated for fact-checking purposes.
Redi, M., Fetahu, B., Morgan, J., & Taraborelli, D. (2019, May). Citation
Needed: A Taxonomy and Algorithmic Assessment of Wikipedia's Verifiability.
In The World Wide Web Conference (pp. 1567-1578). ACM.
https://arxiv.org/abs/1902.11116
Patrolling on Wikipedia
By Jonathan T. Morgan, Research, Wikimedia Foundation
I will present initial findings from an ongoing research study
<https://meta.wikimedia.org/wiki/Research:Patrolling_on_Wikipedia> of
patrolling workflows on Wikimedia projects. Editors patrol recent pages and
edits to ensure that Wikimedia projects maintains high quality as new
content comes in. Patrollers revert vandalism and review newly-created
articles and article drafts. Patrolling of new pages and edits is vital
work. In addition to making sure that new content conforms to Wikipedia
project policies, patrollers are the first line of defense against
disinformation, copyright infringement, libel and slander, personal
threats, and other forms of vandalism on Wikimedia projects. This research
project is focused on understanding the needs, priorities, and workflows of
editors who patrol new content on Wikimedia projects. The findings of this
research can inform the development of better patrolling tools as well as
non-technological interventions intended to support patrollers and the
activity of patrolling.
--
Janna Layton (she, her)
Administrative Assistant - Product & Technology
Wikimedia Foundation <https://wikimediafoundation.org/>
Down with Python 2! I thought it was already dead!
On Fri, Sep 13, 2019 at 10:15 AM Miriam Redi <mredi(a)wikimedia.org> wrote:
> Thanks Luca!
>
> And yes Diego, that, plus casting iterables to lists :)
> Just in case it's useful:
> http://ptgmedia.pearsoncmg.com/imprint_downloads/informit/promotions/python…
>
> Best,
>
> M
>
> On Fri, Sep 13, 2019 at 3:55 PM Diego Saez-Trumper <diego(a)wikimedia.org>
> wrote:
>
>> Oh! We are getting old!
>> Thanks for the heads up Luca.
>>
>> I would say that 90% of the migration process is to change:
>> print 'x' to print('x')
>>
>> ;)
>> Best!
>>
>>
>> On Fri, Sep 13, 2019 at 11:35 AM Luca Toscano <ltoscano(a)wikimedia.org>
>> wrote:
>>
>> > Hi everybody,
>> >
>> > as https://www.python.org/doc/sunset-python-2/ says Python 2 is finally
>> > going EOL on January 1st. We (as Analytics team) have a lot of packages
>> > deployed on stat/notebook/hadoop hosts via puppet that should be
>> removed,
>> > but before doing so we'd need to know if anybody of you is currently
>> using
>> > a Python-2-only environment to work/research/test/etc... If so, please
>> > comment in the following task so we'll discuss your use case and
>> possibly
>> > find a Python-3 solution: https://phabricator.wikimedia.org/T204737
>> > In the task we are going to add info about common packages that we know
>> > (keras, tensorflow, pytorch, etc..) to help you migrate to Python 3 as
>> > quickly and painlessly as possible, so if you are interested please
>> > subscribe to the task.
>> >
>> > Thanks in advance!
>> >
>> > Luca (on behalf of the Analytics team)
>> > _______________________________________________
>> > Wiki-research-l mailing list
>> > Wiki-research-l(a)lists.wikimedia.org
>> > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>> >
>> _______________________________________________
>> Wiki-research-l mailing list
>> Wiki-research-l(a)lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
>>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
--
Aaron Halfaker
Principal Research Scientist
Head of the Scoring Platform team
Wikimedia Foundation
Hi everybody,
as https://www.python.org/doc/sunset-python-2/ says Python 2 is finally
going EOL on January 1st. We (as Analytics team) have a lot of packages
deployed on stat/notebook/hadoop hosts via puppet that should be removed,
but before doing so we'd need to know if anybody of you is currently using
a Python-2-only environment to work/research/test/etc... If so, please
comment in the following task so we'll discuss your use case and possibly
find a Python-3 solution: https://phabricator.wikimedia.org/T204737
In the task we are going to add info about common packages that we know
(keras, tensorflow, pytorch, etc..) to help you migrate to Python 3 as
quickly and painlessly as possible, so if you are interested please
subscribe to the task.
Thanks in advance!
Luca (on behalf of the Analytics team)