Hi Lars,
This is Leila from WMF Research. Recently, we have been receiving a lot of requests about search queries. Here is a response we've given to another researcher few days ago, FYI, and hopefully it will be helpful.
Best, Leila
------------------
As you well know, access to the data you're asking for is not straightforward, and it's a topic that resurfaces every few months, as the editor community is also very interested in it. See for example a recent discussion in here https://lists.wikimedia.org/pipermail/wikimedia-l/2016-July/084745.html.
There are a few things the Research team (that I'm a member of) needs to know before we can say more:
* We need a proposal from you and your collaborators of your project explaining what the project is, a short description of the methodology you're proposing or approaches you want to try, and how the project can contribute to Wikimedia Foundation's mission/plans and/or Wikimedia/Wikipedia community. If there is something in our annual plan https://meta.wikimedia.org/wiki/Wikimedia_Foundation_Annual_Plan/2016-2017/Final that catches your eyes as a potential alignment, please bring that up to us in your proposal.
You can create a page at https://meta.wikimedia.org/wiki/Research for your project and share a link with us. Note that the proposal shouldn't be long. See for example the proposal for this research https://meta.wikimedia.org/wiki/Research:Increasing_article_coverage (search for "Proposal" in the page).
* If you are under time constraints, please be explicit about it in your proposal. Looking at the current list of our collaborations https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations#Current_list_of_formal_collaborators, and knowing that there are few more in the process, you may have to wait for some time before one of us can work with you to make it happen, of course if your proposal is passed by the team.
* To learn more about our formal collaborations, which is the way such access to data can be made possible, please read here https://www.mediawiki.org/wiki/Wikimedia_Research/Formal_collaborations.
Leila Zia Senior Research Scientist Wikimedia Foundation
On Mon, Sep 5, 2016 at 11:19 AM, Nuria Ruiz nuria@wikimedia.org wrote:
By the way, what about alternate, external methods such as subscribing that particular wikibook to Google Search Console?
Our privacy policy prevent us from sending data to third party , so sending analytics data to google is not allowed.
Thanks,
Nuria
On Sun, Sep 4, 2016 at 11:26 PM, Lars Noodén lars.nooden@gmail.com wrote:
On 09/05/2016 07:36 AM, Nuria Ruiz wrote:
Lars,
I am not sure we have at the data you are looking for, the data we get
from
searches is only available for 60 days or less while it gets processed
and
deleted after that. Agreggated pageview data is kept long term, search
data
is not.
Even the most recent 30 to 60 days worth would help. The pageview data shows what is used but gives no hint about why.
So, what would be the process to request access to the raw data and
what would
be the conditions for such access? Access to raw data is normally restricted to research projects. You can perhaps do a request for a 1 time query but, as I was saying, the data
you
are looking for is not available long term.
I've made a request in phabricator, if I understand the request procedure properly.
https://phabricator.wikimedia.org/T144714
You can read about data access here: https://meta.wikimedia.org/wiki/Research:FAQ
Thanks. I'm wading through that one and the nearby pages.
Thanks,
Nuria
By the way, what about alternate, external methods such as subscribing that particular wikibook to Google Search Console? If it is allowed, I might try it to see if it is possible and what it yields.
Regards, Lars
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics