On Mon, Jul 27, 2015 at 2:04 PM, Trey Jones <tjones(a)wikimedia.org> wrote:
My original sample was a 100K sample from zero-results
queries to enwiki
on 7/24. Today I looked at similar samples from 7/10 and 7/17 (since there
is a weekly pattern to traffic) and from 7/22 to compare.
All of the patterns I detected are still present, in approximately the
same volume (give or take a factor of 2), except for the
('"<TITLE>"',
'<AUTHOR(S)>') pattern.
I've started looking at a 500K sample from 7/24 across all wikis. I'll
have more results tomorrow, but right now it's already clear that someone
is spamming useless DOI searches across wikis—and it's 9% of the wiki
zero-results queries.
—Trey
Very interesting. i wonder if they ever get results for the doi searches
(for example some of the references here have doi's:
https://en.wikipedia.org/wiki/DNA). If they are searching specifically for
doi's of specific reference materials, i wish we had a better way to let
them query that (perhaps wikidata eventually, i wonder how their support is
for putting reference material into wikidata).