Monthly dumps are used for the following: 1) Checkwiki - helps clean up syntax and other errors in the source code for several languages and wiki types - pages-articles - https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Check_Wikipedia 2) TemplateParameters - displays template parameter usage, uses TemplateData to validate parameter name usage, for several languages and wiki types - pages-articles - https://bambots.brucemyers.com/TemplateParam.php 3) Wikidata Class Browser - class tree with statistics and common class properties - pages-articles - https://bambots.brucemyers.com/WikidataClasses.php 4) Wikidata NavelGazer - user editing statistics - stub-meta-history, change_tag.sql - https://bambots.brucemyers.com/NavelGazer.php
On 10/8/24 11:59 AM, Bryan Davis wrote:
I was asked recently what I knew about the types of tools that use data from the https://dumps.wikimedia.org/ project. I had to admit that I really didn't know of many tools off the top of my head that relied on dumps. Most of the use cases I have heard about are for research topics like looking at word frequencies and sentence complexity, or machine learning things that consume some or all of the wiki corpus.
Do you run a tool that needs data from Dumps to do its job? I would love to hear some stories about how this data helps folks advance the work of the movement.
Bryan