Hello all users of Airflow,
We need to upgrade airflow on all of our Airflow instances, so I'm
scheduling a maintenance window for tomorrow, Wednesday November 29th at
10:30 UTC and I expect the work to take no more than 30 minutes.
I will pause all active DAGs on all Airflow instances prior to the work,
allow some time for running tasks to complete, then resume the DAGs
afterwards.
Naturally, you are also free to pause your own DAGs prior to the
maintenance and resume them afterwards, should you wish to minimize the
risk of disruption.
Please do let me know if there is anything specific that you would like
me to check, either before or after this maintenance.
Kind regards,
Ben
--
*Ben Tullis*(he/him)
Senior Site Reliability Engineer
Wikimedia Foundation <https://wikimediafoundation.org/>
Hi all,
The next Research Showcase will be live-streamed on Wednesday, November 15,
at 9:30 AM PST / 16:30 UTC. Find your local time here
<https://zonestamp.toolforge.org/1700069400>. This showcase will focus on
*Bibliometrics*, just in time for the GLAM Wiki conference happening this
week in Montevideo.
YouTube stream: https://www.youtube.com/watch?v=IxNa6vgMCDY. As usual, you
can join the conversation in the YouTube chat as soon as the showcase goes
live.
This month's presentations:
Gender and country biases in Wikipedia citations to scholarly publications
By *Chaoqun Ni, University of Wisconsin-Madison*Ensuring Wikipedia cites
scholarly publications based on quality and relevancy without biases is
critical to credible and fair knowledge dissemination. We investigate
gender- and country-based biases in Wikipedia citation practices using
linked data from the Web of Science and a Wikipedia citation dataset. Using
coarsened exact matching, we show that publications by women are cited less
by Wikipedia than expected, and publications by women are less likely to be
cited than those by men. Scholarly publications by authors affiliated with
non-Anglosphere countries are also disadvantaged in getting cited by
Wikipedia, compared with those by authors affiliated with Anglosphere
countries. The level of gender- or country-based inequalities varies by
research field, and the gender-country intersectional bias is prominent in
math-intensive STEM fields. To ensure the credibility and equality of
knowledge presentation, Wikipedia should consider strategies and guidelines
to cite scholarly publications independent of the gender and country of
authors.Exploring Social Attention Dynamics through WikipediaBy *Wenceslao
Arroyo-Machado, Universidad de Granada*The untapped potential of Wikipedia
as a mirror of society's evolving interests and concerns is explored.
Recognizing Wikipedia as a vast, interactive repository of human knowledge,
the investigation focuses on how patterns of edits, views, and discussions
within Wikipedia articles, as well as their features, can serve as
real-time indicators of public interest and engagement. Key findings reveal
that Wikipedia is not just an information source but a reflection of
collective concerns, capturing significant trends and shifts in societal
focus. Additionally, it allows for the highlighting of both local and
international interests. These implications are far-reaching, offering
valuable insights for the Wikipedia community, academic researchers,
policymakers, and the general public. Understanding the dynamics of public
engagement on Wikipedia can inform content strategies, shape research
agendas, and guide public policy, while also providing a deeper
appreciation of the impact and significance of contributions made by the
global Wikipedia community.
You can also watch our past research showcases here:
https://www.mediawiki.org/wiki/Wikimedia_Research/Showcase
Best,
Kinneret
--
Kinneret Gordon
Lead Research Community Officer
Wikimedia Foundation <https://wikimediafoundation.org/>
Hello,
We have to carry out some scheduled maintenance that will require a
brief period of disruption for our analytics_meta
<https://wikitech.wikimedia.org/wiki/Data_Engineering/Systems/Analytics_Meta>
MariaDB service, whilst it is moved to a new primary host
<https://phabricator.wikimedia.org/T284150>. This will affect Hive, both
Druid clusters, Superset, Hue, and DataHub.
I plan to do this work tomorrow morning, starting shortly after 11:00
UTC and I expect the change to take no more than around 20 minutes,
during which time you might find that the above services are disrupted.
Our production pipelines that write to HDFS and Hive will be while the
work is being carried out.
I will likely also put HDFS briefly into Safe Mode, which prevents write
access, whilst I reconfigure and restart Hive.
If you could plan to work your own tasks and pipelines around this
maintenance window, I would be grateful. Please do get in touch if you
have any questions, or this maintenance plan will cause you any specific
inconvenience. If you think that you will be adversely affected I can
see whether it is possible to reschedule or find another workaround for you.
Kind regards,
Ben
--
*Ben Tullis*(he/him)
Senior Site Reliability Engineer
Wikimedia Foundation <https://wikimediafoundation.org/>