On Mon, Jul 27, 2015 at 2:04 PM, Trey Jones <tjones@wikimedia.org> wrote:
My original sample was a 100K sample from zero-results queries to enwiki on 7/24. Today I looked at similar samples from 7/10 and 7/17 (since there is a weekly pattern to traffic) and from 7/22 to compare.

All of the patterns I detected are still present, in approximately the same volume (give or take a factor of 2), except for the ('"<TITLE>"', '<AUTHOR(S)>') pattern.

I've started looking at a 500K sample from 7/24 across all wikis. I'll have more results tomorrow, but right now it's already clear that someone is spamming useless DOI searches across wikis—and it's 9% of the wiki zero-results queries.

—Trey


Very interesting. i wonder if they ever get results for the doi searches (for example some of the references here have doi's: https://en.wikipedia.org/wiki/DNA). If they are searching specifically for doi's of specific reference materials, i wish we had a better way to let them query that (perhaps wikidata eventually, i wonder how their support is for putting reference material into wikidata).