On Tue, Jan 3, 2017 at 9:30 AM, Stas Malyshev smalyshev@wikimedia.org wrote:
Hi!
1. Is there a unique key for the query log? The log I am refering to is the *wdqs_extract* table**from the hive database wmf.**We would like to be able to permanently link our own computed data with the log entry we computed it from.
I think you can use hostname+sequence (from https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest, assuming those are preserved in wdqs_extract) as a key.
Adrian, you can also consider adding other fields to Stas' recommendation to create the key, to be sure about uniqueness. For example, IP and UA fields, in combination with hostname and sequence (or browser language, if it's relevant in your case). Let us know what you end up using on this thread, so we know the answer for the future. :)
Best, Leila
-- Stas Malyshev smalyshev@wikimedia.org
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics