On Wed, 24 Feb 2021 at 20:04, Samat <samat78(a)gmail.com> wrote:
1) I don't understand "overall size
rank". Based on the definitions, it is
the "count of unique devices which visited the wiki during that month".
What does this do with the size of the wiki?
Thank you very much for noticing this! It looks like we accidentally mixed
up some of the definitions. I've restored the correct definition of overall
size rank, which is "a composite ranking of wikis by monthly active editors
and monthly unique devices (produced by taking the geometric mean of those
two values)". I checked all of the other definitions and they should be
correct.
2) I miss the *size of the database* (main
namespace/content pages),
which together with the number of content pages would give an impression of
the mean size of the articles (I know this is not perfect, but better than
nothing).
3) Many more wishes about additional metrics :D
Size of the database is a good suggestion, but sadly my team doesn't have
time to add any more metrics right now. If you'd like to note down your
suggestion for the future, we have been keeping a list at
mediawiki.org/wiki/Product_Analytics/Wiki_comparison_suggestions
<https://www.mediawiki.org/wiki/Product_Analytics/Wiki_comparison_suggestions>
.
(We'd also be thrilled to help others expand the dataset directly, but this
is likely to be difficult since our code
<https://github.com/wikimedia-research/wiki-segmentation/tree/master/data-collection>
currently relies on production data access
<https://wikitech.wikimedia.org/wiki/Analytics/Data_access>, which is only
available to staff of the Wikimedia Foundation or Wikimedia
Deutschland or official
research collaborators
<https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations>.)
--
Neil Shah-Quinn
senior data scientist, Product Analytics
<https://www.mediawiki.org/wiki/Product_Analytics>
Wikimedia Foundation <https://wikimediafoundation.org/>