We would store it locally like we do with country and continent lookup list, and could manually vet whether cities are > say 100,000 people)


I'm not sure that would always provide the safety we're looking for.  Because police work, by a nefarious agent in a city with 100,000 people, would quite easily lead to the identity of a specific editor.

 

As for latitude/longitude, again, these should be rounded on purpose.

If we round on  0.5 degree, this gives a latitudinal resolution of around 55 km or 30 mi at the equator, and 22 km or 12 mile at the arctic circle.

 

(Again state or region lookup might be too costly to lookup anyway, but that is another matter)


Unfortunately, I think 30 miles would not provide enough anonymity in China because in some 30 mile areas there may only be a few small villages.  Also unfortunately 30 miles would not provide the accuracy that James needs to capture Washington D.C. activity, because any log line would show up in Maryland, Virginia, and D.C. simultaneously.

I think we have to turn this request on its head a little bit and think about the people who are going to be potentially identified.  We somehow have to get their permission to analyze this data.  If you look at any other geo-analysis being performed by Apple, Google, etc. this is not unusual - they always ask permission from the end user being tracked.  We could ask permission in the same way, but maybe find a way to be less creepy than the typical Google approach.