Hi all,
The next Research Showcase, focused on Gender and Equity on Wikipedia, will
be live-streamed Wednesday, March 15, at 9:30 AM PST / 16:30 UTC. Find your
local time here <https://zonestamp.toolforge.org/1678897840>.
YouTube stream: https://www.youtube.com/watch?v=lw4MzJgDIzo
You can join the conversation on IRC at #wikimedia-research. You can also
watch our past research showcases here:
https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
This month's presentations:
Men Are elected, women are marriedː events gender bias on Wikipedia
By *Jiao Sun, University of Southern California*Human activities can be
seen as sequences of events, which are crucial to understanding societies.
Disproportional event distribution for different demographic groups can
manifest and amplify social stereotypes, and potentially jeopardize the
ability of members in some groups to pursue certain goals. In this paper,
we present the first event-centric study of gender biases in a Wikipedia
corpus. To facilitate the study, we curate a corpus of career and personal
life descriptions with demographic information consisting of 7,854
fragments from 10,412 celebrities. Then we detect events with a
state-of-the-art event detection model, calibrate the results using
strategically generated templates, and extract events that have asymmetric
associations with genders. Our study discovers that the Wikipedia pages
tend to intermingle personal life events with professional events for
females but not for males, which calls for the awareness of the Wikipedia
community to formalize guidelines and train the editors to mind the
implicit biases that contributors carry. Our work also lays the foundation
for future works on quantifying and discovering event biases at the corpus
level.
- Paperː Sun, J. & Peng, N. (2021). Men Are Elected, Women Are Married:
Events Gender Bias on Wikipedia. Proceedings of the 59th Annual Meeting of
the Association for Computational Linguistics and the 11th International
Conference on Natural Language Processing, 350-360.
<https://aclanthology.org/2021.acl-short.45.pdf>
Twitter reacts to absence of women on Wikipediaː a mixed-methods analysis
of #VisibleWikiWomen campaignBy *Sneh Gupta, Guru Gobind Singh Indraprastha
University*Digital gender divide (DGD) is visible in access, participation,
representation, and biases against women embedded in Wikipedia, the largest
digital reservoir of co-created content. This article examined the content
of #VisibleWikiWomen, a global digital advocacy campaign aimed at
encouraging inclusion of women voices in the global technology conversation
and improving digital sustainability of feminist data on Wikipedia. In a
mixed-methods study, Sentiment Analysis followed by a Feminist Critical
Discourse Analysis of the campaign tweets reveals how digital gender divide
manifested in the public response. An overwhelming majority of tweets
expressed positive sentiment towards the objective of the campaign. An
inductive reading of the coded tweets (n = 1067) generated five themes:
Feminist Activism, Invisibility & Marginalization of Women, Technology for
Women Empowerment, Gendered Knowledge Inequity, and Power Dynamics in the
Digital Sphere. Twitter discourse presented many agitated digital users
calling out the epistemic injustice on Wikipedia that goes beyond the
invisibility of women. Their tweets reveal that they want an equal social
platform inclusive of women of color and varied identities currently absent
in the Wikipedia universe. Extracting ideas, values, and themes from new
media campaigns holds unparalleled potential in the diffusion of
interventions and messages on a larger scale.
- Paperː Gupta, S., & Trehan, K. (2022). Twitter reacts to absence of
women on Wikipedia: a mixed-methods analysis of #VisibleWikiWomen campaign.
Media Asia, 49(2), 130-154.
<https://www.researchgate.net/publication/356909618_Twitter_reacts_to_absenc…>
Warm regards,
Emily
--
Emily Lescak (she / her)
Senior Research Community Officer
The Wikimedia Foundation
Hi all,
The Wikimedia Research Showcase [0] is almost ten years old and we're using
this upcoming birthday as an opportunity to step back and reflect on the
past, celebrate the contributions by our speakers and many of you who have
participated in the discussions, and plan for its future. We invite you to
complete this brief survey [1] to share with us your perspectives on the
Showcase's value, ideas for improvement, and potential topics and speakers
we can feature in the future. We anticipate that the survey will take less
than 10 minutes to complete. We appreciate your response by March 31, 2023.
Please review our privacy statement [2] prior to completing the survey.
Thank you for your time.
Sincerely,
Emily Lescak and Leila Zia
[0] https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
[1]
https://docs.google.com/forms/d/e/1FAIpQLSfdi3kZ9xLO5XX3HQx1AlCchf4Sd7Mchhl…
[2]
https://foundation.wikimedia.org/wiki/Legal:Research_Team_Event_Feedback_Su…
--
Emily Lescak (she / her)
Senior Research Community Officer
The Wikimedia Foundation
====
SEMANTiCS - 19th International Conference on Semantic Systems
Leipzig, Germany
Workshops and Tutorials
September 20 - 22, 2023
https://2023-eu.semantics.cc/page/cfp_ws
====
SEMANTiCS 2023 is a major venue for research and industrial innovation
and features a workshop and tutorial program addressing the diverse
practical interests of its audience. This program is intended to offer a
rich diversity of topics to conference attendees and local participants
seeking to pick up new skills and stay up-to-date regarding the latest
developments in the community. We encourage submissions of proposals on
all topics in the general areas of SEMANTiCS 2023 and proposals bridging
or introducing new perspectives in these areas. Workshops and tutorials
may incorporate panel discussions, lightning talks, meetings, networking
or hands-on sessions, hackathons and other practical formats where
applicable. Rooms for business or project meetings are available upon
request as well.
=Important Dates for Workshops=
* Proposals WS *Extended* Deadline: March 15, 2023 (11:59 pm, Hawaii time)
* Notification of Acceptance: March 22, 2023 (11:59 pm, Hawaii time)
=Important Dates for Tutorials (and other meetings, e.g. seminars,
show-cases, etc., without call for papers)=
* Proposals Tutorial Deadline: June 06, 2023 (11:59 pm, Hawaii time)
* Notification of Acceptance: June 20, 2023 (11:59 pm, Hawaii time)
Submission via Easychair on https://easychair.org/conferences/?conf=sem23
=Scope & Goals=
Workshops and tutorials at SEMANTiCS 2023 allow your organisation or
project to advance and promote your topics and gain increased
visibility. The workshops and tutorials will be announced on the
SEMANTiCS website and they will be seen by all participants. SEMANTiCS
2023 workshops and tutorials can be incubators for industrial and
scientific communities that form and share a particular research and
development agenda. They provide a forum for presenting contributions
and findings to a diverse and knowledgeable community.
Furthermore, the event can be used as a dissemination activity in the
scope of large research projects or as a closed format for
research/commercial project consortia meetings.
=Setup and Requirements=
SEMANTiCS 2023 workshops and tutorials may be either half or full day
long. Workshops and tutorials take place on the days before and/or after
the main SEMANTiCS 2023 EU conference (20th, 21st, and/or 22nd of
September 2023). Details will be communicated on time.
Organizers of workshops and tutorials will be granted three free tickets
(only for the workshop & tutorial day) for organization purposes or
keynotes. Participants of workshops and tutorials will be charged a
marginal fee to cover the basic costs.
Workshop and tutorials proposals must include the following information:
* outline of the themes and goals of the event, including a title and a
brief abstract (less than 200 words) intended for the SEMANTiCS 2023 website
* a statement addressing why the event is important, why the event is
timely, how it is relevant to SEMANTiCS 2023 and the field of semantic
web. For the tutorials, why the presenters are qualified for a
high-quality introduction of the topic
* related workshops and conferences, i.e., specifying if this is a
continuation of a workshop series or is a new workshop to address an
emerging issue. Please provide information about past versions of this
workshop and other related workshops (including URLs and
submission/acceptance counts, if available).
* a statement addressing the quality assurance criterion that will be
used by the event organizers to select the papers for the workshops and
the presenters for the tutorials (e.g., peer review or review/evaluation
by event organizers). If a peer review process is chosen as a quality
assurance criterion for the workshops, the organizers will be
responsible for their own reviewing process. Workshop organizers will be
responsible also for their own publicity (e.g., website, timelines and
call for papers) and proceedings production.
* structure of the event and plans for generating and stimulating
discussion; how will the interaction be organized in case of a hybrid event
* desired minimum and maximum number of event participants, expected
number of participants, and (in case of previously held events) number
of registered attendees and web site for previous editions of the event
* a description of the intended audience and the expected learning outcomes
* desired prerequisite knowledge of the audience
* proposed duration of the event (i.e., half or full day), different
sessions if applicable (final time slot will be assigned in accordance
with the SEMANTiCS program)
* any equipment, room capacity, or other logistic constraints
* full contact information of all organizers of the event and main
contact person; a brief description of each organizer's background,
including relevant past experience in organizing events
Proposals for workshop and tutorial proposals must be submitted via
Easychair: https://easychair.org/my/conference?conf=sem23
=Review and Evaluation Criteria=
Workshop and tutorial proposals will be reviewed by the SEMANTiCS 2023
Workshop Chairs, as well as by the SEMANTiCS 2023 organizing committee,
according to the following criteria:
* The potential to advance the state of semantic web research and practice
* The quality assurance criterion proposed by the organizers to select
high-quality papers for workshops and presenters for tutorials
* The organizers' experience and ability to lead a successful event
* Timeliness and expected interest in the event topics
* The balance and synergy between all SEMANTiCS 2023 events
=Topics of interest include (but are not limited to)=
* Web Semantics & Linked (Open) Data
* Enterprise Knowledge Graphs, Graph Data Management and Deep Semantics
* Machine Learning & Deep Learning Techniques
* Semantic Information Management & Knowledge Integration
* Terminology, Thesaurus & Ontology Management
* Data Mining and Knowledge Discovery
* Reasoning, Rules and Policies
* Natural Language Processing and Computational Linguistics
* Social and Human aspects of Semantic Web
* Data Quality Management and Assurance
* Explainable Artificial Intelligence
* Semantics in Data Science
* Semantics of Blockchain & Distributed Ledger Technologies
* Trust, Data Privacy, and Security with Semantic Technologies
* Economics of Data, Data Services and Data Ecosystems
* Applications of Semantic Web technologies in domains such as law,
medicine, life sciences, digital humanities, mobility and smart cities, etc.
We especially invite contributions that illustrate the applicability of
the topics mentioned above for industrial purposes and/or illustrate the
business relevance of their contribution for specific industries.
Workshop proposals on emerging themes for the topics listed above are
encouraged.
In case you have additional questions concerning the submission process,
please do not hesitate to contact us via Easychair.
We are looking forward to your contribution!
Jennifer D’Souza - jennifer.dsouza(a)tib.eu
Anisa Rula - anisa.rula(a)unibs.it
Workshop & Tutorial Chairs
Hi All,
for an applied research work, I am working on extracting links from the
Wikipedia corpus.
I've been using in the past the XML streams, but not I was hoping to speed
up and handle better the situation by parsing the sql tables.
However, I am stuck on this:
I could not find a way to filter the relevant links.
I can only filter by namespace apparently, while I want to only keep the
links that were mentioned in the main text, still namespace 0, but not
belonging to the infoboxes and navboxes menu.
How could I do that?
Is there any information that a link belongs to a menu or to the main
content, beyond the namespace?
Thanks All for your help,
L.