Hi all you EventLogging users out there!
We will switch EventLogging MySQL ingestion to be based on a schema
whitelist rather than blacklist.
As you know, we currently import EventLogging events into two locations for
analysis: The MySQL ‘log’ database, and the Hive ‘event’ database. MySQL
is not able to handle high volume events. We currently blacklist
any schemas that we know have high volumes from being ingested into MySQL.
This can cause problems when a new high volume schema is deployed, as it
requires knowledge and communication from the schema owners to the
Analytics team, and it requires an Analytics Operations engineer to make a
Puppet commit to blacklist the schema. To address this problem, we will
switch the EventLogging MySQL schema blacklist to a whitelist. All schemas
that are actively being ingested into MySQL today will be whitelisted. In
the future, if you want an event schema to be ingested into MySQL, you’ll
need to ask the Analytics team to whitelist it.
Hive has been working for EventLogging analysis for a while now. It has
almost all of the schemas that MySQL does, plus the high volume ones. One
day in the (distant?) future, we’d like to decommission MySQL storage of
events. (Don’t worry yet, MySQL decommissioning has a lot of blockers and
this work is not planned.) By not ingesting events into MySQL by default,
we hope to encourage more users to switch to Hive.
This switch to a whitelist will happen this week. If you are deploying a
new schema and expect it to show up in MySQL, let us know so we can
-Andrew + the Analytics Engineering team