On Tue, Jan 3, 2017 at 9:30 AM, Stas Malyshev <smalyshev(a)wikimedia.org>
wrote:
Hi!
1. Is there a unique key for the query log?
The log I am refering to
is the *wdqs_extract* table**from
the hive database wmf.**We would like to be able to
permanently link our own computed data with the log entry we
computed it from.
I think you can use hostname+sequence (from
https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest, assuming
those are preserved in wdqs_extract) as a key.
Adrian, you can also consider adding other fields to Stas' recommendation
to create the key, to be sure about uniqueness. For example, IP and UA
fields, in combination with hostname and sequence (or browser language, if
it's relevant in your case). Let us know what you end up using on this
thread, so we know the answer for the future. :)
Best,
Leila
--
Stas Malyshev
smalyshev(a)wikimedia.org
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics