Hi everybody,
On Monday 21st we'd like to reboot all stat100x hosts for Linux kernel
upgrades at around 9 AM CET. This means that all the notebooks and various
activities running on those nodes will be stopped for a brief amount of
time. To repay your patience, two things will be added:
- A shared kerberos credential cache with notebooks. This practically means
that you will only be required to kinit once (either after doing ssh to
stat100x or in a Jupyter notebook), and the credentials will be shared (no
more double kinit etc..). It is already "live" on stat1004 if you want to
test it! Since the new shared credential will have a new location on disk,
all kerberos sessions will be destroyed and you'll have to kinit again when
the reboots are completed. More details in
https://phabricator.wikimedia.org/T255262.
- A new endpoint for Hive called 'analytics-hive.eqiad.wmnet', that should
replace hive jdbc/metastore configs hardcoding an-coord1001.eqiad.wmnet
(and allow us to failover transparently if needed without requesting job
restarts etc..). The side effect of this is that all hive-related tools
will change configs (transparently for external users). If you have any
script that points directly to hive via JDBC (for example a Python script
using PyHive etc..) please update it with the new endpoint.
If this schedule impacts your work, please ping me via email/IRC/etc.. and
I'll try to reschedule accordingly :)
Thanks!
Luca (on behalf of Analytics / Data Engineering)
Hi all,
On Tuesday, December 22, from 15-16 UTC (10-11am EST, 7-8am PST),
superset.wikimedia.org will be offline to upgrade the hardware and add
caching as part of https://phabricator.wikimedia.org/T268219.
When the upgrade is complete, by default, charts will be cached for 12
hours. As you can see in the following screenshot, you can view whether a
chart is cached from the overflow menu, and you'll have the option to force
refresh it.
[image: Screen Shot 2020-12-16 at 11.40.51 AM.png]
The time that any given chart will be cached is configurable via the "edit
chart" menu item. For example, set the cache timeout to 3600 seconds for
data to cache for an hour.
[image: image.png]
Reply to this email or reach out to razzi or the #wikimedia-analytics IRC
channel if you have any questions or concerns about this migration. As
always, the maintenance schedule can be viewed here
<https://wikitech.wikimedia.org/wiki/Analytics/Systems/Maintenance_Schedule>
.
Regards,
Razzi
Hi everybody,
The Analytics team is trying to simplify the access request process to the
stat100x clients to avoid, as much as possible, confusion for the user
requesting access or for the SRE reviewing the access request. The
following is happening:
* analytics-users and researchers POSIX group are being deprecated in
https://phabricator.wikimedia.org/T269150 and
https://phabricator.wikimedia.org/T268801. They are used only by few users
and they are not needed anymore nowadays. To be clear, we are not trying to
deprecate the Research team, we love them :)
* analytics-privatedata-users becomes the standard POSIX group to access
the stat100x hosts and the Hadoop cluster. A user will be able to require
only membership to the group (granting access to the stat100x hosts plus
some PII data like the one on Mariadb Wiki-replicas etc..) or also to
request the additional Kerberos account, to have access to Hadoop's PII
data too (and compute power).
The main idea is to shift the focus of the user requesting access to the
fact that they will be exposed to PII data in some form, so careful steps
will need to be taken (see
https://wikitech.wikimedia.org/wiki/Analytics/Data_access#User_responsibili…
).
As always, feedback and suggestions are welcome!
Luca (Analytics team)