Lars,
I am not sure we have at the data you are looking for, the data we get from searches is only available for 60 days or less while it gets processed and deleted after that. Agreggated pageview data is kept long term, search data is not.
So, what would be the process to request access to the raw data and what would
be the conditions for such access? Access to raw data is normally restricted to research projects. You can perhaps do a request for a 1 time query but, as I was saying, the data you are looking for is not available long term.
You can read about data access here: https://meta.wikimedia.org/wiki/Research:FAQ
Thanks,
Nuria
On Sun, Sep 4, 2016 at 4:38 AM, Lars Noodén lars.nooden@gmail.com wrote:
Thanks, Dan and Nuria, for the responses.
I see that the 'webrequest' table [1] with the current schema would have the field with raw header containing a superset of the data I am looking for with regard to the Wikibook:
referer string Referer header of request
but I don't think I would be able to propose a generic database query that would produce sufficiently sanitized data. At this point, I'm looking for only the search strings.
I'm also not sure of the contents of uri_path or uri_query to know which one would restrict the search to specific Wikibooks.
So, what would be the process to request access to the raw data and what would be the conditions for such access? If I were to pursue that, as far as a general interest research project goes, the referred search terms could be grouped by Featured Book (plus the one, non-featured book I am aiming for). There are about 200 English language Featured Books [2] at the moment.
Regards, Lars
[1] https://wikitech.wikimedia.org/wiki/Analytics/Data/ Webrequest#Current_Schema
[2] https://en.wikibooks.org/wiki/Wikibooks:Featured_books#Featured_books
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics