[pywikibot] Re: Testing against live wikis

26 Jun 2023


      Hi folks,
Some thoughts on the following from someone who worked on Pywikibot 
tests a /very/ long time (>10 years...) ago:
On 26/06/2023 00:47, Roy Smith wrote:
...
I'm concerned that we're writing tests which depend on knowing the 
current contents of a live production wiki.
On 15/05/2023 21:11, YiFei Zhu wrote:
...
This way, if a unit test fails, you know
it's Pywikibot's code that is to blame and not something like a
MediaWiki change or the environment setup.
First of all: tests that randomly fail, and tests that require some 
not-so-well documented set of steps to run at all are /a pain/. So 
there's certainly room for improvement there.
On thing to keep in mind is that Pywikibot is a bit weird, as there is 
only a limited set of things it does /on itself/. The value of Pywikibot 
lies in the interaction with Mediawiki, and by and large /with the 
specific version of Mediawiki deployed on Wikimedia wikis (including all 
the weird extensions)/. And because of that, there is a massive value in 
tests that test /the integration with Wikimedia wikis/: it's great if 
mocked API calls work, but if it breaks against the real API, Pywikibot 
is not doing its job for the user. Unfortunately that does mean the 
/contents/ of the wiki now are a dependency for the test as well.
The alternative is mocking bits of Pywikbot: either the API requests or 
the Site object itself. The former is something I've played with in the 
past by recording (caching) the requests sent to the actual site. 
Although that works, there are two main issues: you regularly need to 
update the 'logged' requests (as some may be missing etc), and you run 
the risk of leaking private data. The second method sounds more 
appealing, but in practice, you'll be mocking /every/ method. In effect, 
at that point you're re-implementing the Mediawiki API in Python. Might 
as well talk to the real one then?
With Docker containers being a fairly standard thing (although still a 
pain under Windows) spinning up a test container with a well-known MW 
version and well-known content might be the best of both worlds. A 
stable development environment while still being able to run the tests 
against both a 'latest' MW container and a live site (but that can then 
just be done on the CI server rather than trying to run all local tests 
on a live site?).
In the end, all approaches have their own share of pain. Live testing 
requires a fiddly live-site-setup, mocking is brittle, and docker.... 
has its own steep learning curve. The only true advice I can give here 
is 'choose your battles' and 'document everything you can, and then 
some' :-)
Cheers,
Merlijn

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

[pywikibot] Re: Testing against live wikis