Greetings, everyone –
Over the summer and into the fall the LD4 Wikidata Affinity Grouphttps://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Group will be offering a series of Wikidata Working Hourshttps://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Group/Wikidata_Working_Hours/Wikidata_Working_Hour_Summer-Fall_Project_2023 to give folks an opportunity to try out various Wikidata-related skills and tools by assembling a data set of diverse library and information science (LIS) materials (articles, conference proceedings, books) and adding it to Wikidata. Wikidata Working Hours provide hands-on Wikidata experience in a supportive space. We hope you will join us if you are interested in learning more about Wikidata, exploring LIS literature, and have been looking for a fun Wikidata project to contribute to.
The fourth Wikidata Working Hour in the series will cover the introduction of the Wikimedia PAWS environment as an option for data gathering and processing for your Wikidata project needs. We will spend the majority of the time looking at web scraping for article data using Python and the Beautiful Soup package for parsing. This includes looking at the scraping results of some pages and discussing what makes a page a good candidate for web scraping. We will also use this as an opportunity to think about data models and how to make your data approachable for both machines and humans. Lastly, we will fill out the skeleton of a web scraper with the appropriate information to scrape a page ourselves and then appreciate how this process can be scaled to scrape multiple pages while running a single script.
Date and time: Friday, September 29th, 2023 at 10:00am PT / 1:00pm ET / 17:00 UTC / 6:00pm WAT / 7:00pm CEST (Time zone converterhttps://zonestamp.toolforge.org/1696006800)
Zoom link to join: https://stanford.zoom.us/j/94752745101?pwd=amk1OG9XYTRaT2xGd3gzZ3I2U1pwZz09
Password: 368617
Event page: https://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Gro...
Subsequent Working Hours will cover adding authors and publishers manually into Wikidata, batch editing using OpenRefine, batch editing using the LINCS tool, using the Author Disambiguator tool, and analyzing and visualizing data with SPARQL and Scholia.
This session will be recorded and the recording shared on the event page.
To be sure to receive announcements about future Wikidata Working hours, subscribe to the ld4-wikidata Google Grouphttps://groups.google.com/d/forum/ld4-wikidata.
Other ways to follow what’s going on with the Affinity Group:
Ld4-wikidata Google group: https://groups.google.com/d/forum/ld4-wikidatahttps://groups.google.com/d/forum/ld4-wikidata
#wikidata channel on LD4 Slack: https://join.slack.com/t/ld4/shared_invite/zt-23u674xs5-_E7sS0SWJwsBNsNvrILo...
Notes in public LD4 Wikidata Affinity Group folder: https://drive.google.com/drive/folders/1JwTulCABs0TkGQDVSnYbIYEb7bC-j4-nhttps://drive.google.com/drive/folders/1JwTulCABs0TkGQDVSnYbIYEb7bC-j4-n
WikiProject page: https://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Gro...https://www.wikidata.org/wiki/Wikidata:WikiProject_LD4_Wikidata_Affinity_Group
Hilary Thorsen Resource Sharing Librarian Stanford Libraries thorsenh@stanford.edu 650-285-9429