Hi all,
Join the Research Team at the Wikimedia Foundation [1] for their monthly
Office hours this Tuesday, 2021-08-03, at 16:00-17:00 UTC (9am PT/6pm
CEST).
To participate, join the video-call via this link [2]. There is no set
agenda - feel free to add your item to the list of topics in the etherpad
[3] (You can do this after you join the meeting, too.), otherwise you are
welcome to also just hang out. More detailed information (e.g. about how to
attend) can be found here [4].
Through these office hours, we aim to make ourselves more available to
answer some of the research related questions that you as Wikimedia
volunteer editors, organizers, affiliates, staff, and researchers face in
your projects and initiatives. Some example cases we hope to be able to
support you in:
-
You have a specific research related question that you suspect you
should be able to answer with the publicly available data and you don’t
know how to find an answer for it, or you just need some more help with it.
For example, how can I compute the ratio of anonymous to registered editors
in my wiki?
-
You run into repetitive or very manual work as part of your Wikimedia
contributions and you wish to find out if there are ways to use machines to
improve your workflows. These types of conversations can sometimes be
harder to find an answer for during an office hour, however, discussing
them can help us understand your challenges better and we may find ways to
work with each other to support you in addressing it in the future.
-
You want to learn what the Research team at the Wikimedia Foundation
does and how we can potentially support you. Specifically for affiliates:
if you are interested in building relationships with the academic
institutions in your country, we would love to talk with you and learn
more. We have a series of programs that aim to expand the network of
Wikimedia researchers globally and we would love to collaborate with those
of you interested more closely in this space.
-
You want to talk with us about one of our existing programs [5].
Hope to see many of you,
Martin on behalf of the WMF Research Team
[1] https://research.wikimedia.org
[2] https://meet.jit.si/WMF-Research-Office-Hours
[3] https://etherpad.wikimedia.org/p/Research-Analytics-Office-hours
[4] https://www.mediawiki.org/wiki/Wikimedia_Research/Office_hours
[5] https://research.wikimedia.org/projects.html
--
Martin Gerlach
Research Scientist
Wikimedia Foundation
[Cross-posting from the Wikidata chat]
Hi everyone,
Following some feedback by Azertus (thanks!), I collected statistics on
the most frequent Web domains that occur in Discogs [1] and MusicBrainz
[2]. It looks like some of them may be candidates for identifier
property creation, while others stem from a failed match against known
properties, mainly due to inconsistencies in URL match pattern (P8966),
format as a regular expression (P1793), and formatter URL (P1630) values.
You can have a look at them here [3].
It would be great to gather thoughts on the next steps.
Two main questions:
1. should we go for a property proposal for each of the candidates?
2. what's the best way to fix URL match pattern (P8966), format as a
regular expression (P1793), and formatter URL (P1630) values, so that
the next time we can convert URLs to proper identifiers?
Cheers,
Marco
[1] https://www.discogs.com/
[2] https://musicbrainz.org/
[3]
https://meta.wikimedia.org/wiki/Grants:Project/Hjfocs/soweego_2/Timeline#Ju…