Hey stat1005|6 users!
The underlying host currently providing all of your dumps and datasets
needs over NFS (at /mnt/data) is being replaced soon. All datasets will be
continue to be accessible on the stat boxes at the current path, but there
will be a transition time of a few hours. During that time, you may
encounter stale data or the files may simply be inaccessible. Please
schedule your work accordingly.
Dates: The migration is scheduled for April 2nd starting at 14:30 UTC, and
is expected to last a few hours.
Thanks! We'll send more updates closer to the migration date. If you have
any questions, just let us know.
Madhumitha Viswanathan & Ariel Glenn
The 1k limit is a hard one: only the top 1000 articles for a given day get
loaded into the database. I added the folks from the Analytics team to this
thread, they may be able to help you, as they generate and expose the data
Marko Obrovac, PhD
Senior Services Engineer
On 30 March 2018 at 16:59, Srdjan Grubor <srdjan(a)endlessm.com> wrote:
> I asked this on IRC but didn't get any replies so I'm following it up this
> I have a question about the newer metrics REST v1 API: is there a way to
> specify how many top articles to pull from https://wikimedia.org/api/
> access_year_month_day or is 1k hardcoded? Old metrics data was available
> that had the most viewed pages but that disappeared with the change to the
> new API.
> The reason I ask is because we (https://endlessos.com) are trying to
> rebuild our stale encyclopedia apps for offline usage but are space-limited
> and would only like to include the most likely pages that would be looked
> at that can fit within a size envelope that varies with the device in
> question (up to 100k article limit probably) but the new API doesn't
> provide us with the tools to figure out the rankings cleanly (other than
> rate-limiting on our side and checking every single article's metric
> endpoint for counts).
> So the main question is: do we have a way to get this data out with the
> current API? If this data is not available, can the "
> metrics/pageviews/top" API be augmented to maybe have a `skip` and/or `
> limit` params like other similar services that have this type of
> Srdjan Grubor | +1.314.540.8328 <(314)%20540-8328> | Endless
> Services mailing list