On Tue, Jan 3, 2017 at 9:30 AM, Stas Malyshev <smalyshev@wikimedia.org> wrote:
Hi!

>     1. Is there a unique key for the query log? The log I am refering to
>     is the *wdqs_extract* table**from
>     the hive database wmf.**We would like to be able to
>     permanently link our own computed data with the log entry we
>     computed it from.

I think you can use hostname+sequence (from
https://wikitech.wikimedia.org/wiki/Analytics/Data/Webrequest, assuming
those are preserved in wdqs_extract) as a key.

​Adrian, you can also consider adding other fields to Stas' recommendation to create the key, to be sure about uniqueness. For example, IP and UA fields, in combination with hostname and sequence (or browser language, if it's relevant in your case). Let us know what you end up using on this thread, so we know the answer for the future. :)

Best,
Leila​

 


--
Stas Malyshev
smalyshev@wikimedia.org

_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics