[Pywikipedia-bugs] [Maniphest] [Commented On] T84944: test failure in site.randompages

30 Jan 2015

jayvdb added a comment.

To me, 'step' feels like it is breaking a batch into non overlapping subsets,
which isnt strictly true if each 'step' is a new random sequence, especially if
each batch contains only unique items (which means the server algorithm is slightly
reducing the randomness, when a duplicate appears).

If we look at a very small wiki, the underlying generator doesnt repeat if the limit isnt
reached.
https://www.molnac.unisa.it/BioTools/mdcons/api.php?action=query&genera…
https://www.molnac.unisa.it/BioTools/mdcons/index.php/Special:ListFiles

IMO, in site.randompages, we are trying to expose the underlying MediaWiki API, and it
doesnt have continuation.  A caller cant know that limit 20 is two batches of 10 from the
server algorithm, or a single batch of 20 from the server algorithm (which is unique?). 
The only way to have any chance of knowing that is to obtain the API limit from paraminfo
, and use that.

However, I dont know the underlying randomness algorithm well enough to speak with much
authority about that, or how many of our users are wanting to 'see' the underlying
randomness vs happy with any randomness that has slight oddities introduced by multiple
disjoint batches.

Whatever we do, we need to update the docstring to explain what we are doing in case the
caller cares.

TASK DETAIL
  https://phabricator.wikimedia.org/T84944

REPLY HANDLER ACTIONS
  Reply to comment or attach files, or !close, !claim, !unsubscribe or !assign
<username>.

EMAIL PREFERENCES
  https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: jayvdb
Cc: gerritbot, valhallasw, jayvdb, Aklapper, Mpaa, pywikipedia-bugs

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

[Pywikipedia-bugs] [Maniphest] [Commented On] T84944: test failure in site.randompages