Hmm... expand the regex, to make sure you're accounting for most of the
possible rationales? There are a number of word frequency calculators out
there that will show you the most common words in block rev edit comments.
Some will even exclude common
stopwords<http://en.wikipedia.org/wiki/Stop_words>
.
Bonus points if you make a pretty wordle <http://www.wordle.net/> out of
your results and upload it to commons.
- J
On Tue, Apr 30, 2013 at 4:01 PM, Oliver Keyes <okeyes(a)wikimedia.org> wrote:
Good point. Feedback on future things to look at/how
to turn this into
something that might have a productive, rather than informative, outcome?
On 30 April 2013 23:56, Jonathan Morgan <jmorgan(a)wikimedia.org> wrote:
Nice work, Oliver.
Minor feedback: I'd remove the 2013 data from the first 2 graphs. It's
easy for people to miss the fact that those data are for < 1 year, even
though you explain that in the text (and given it should be obvious to
someone reading this in April...).
However, what jumps out to someone who's skimming is "whoa, sudden drop
in [interesting phenomenon foo] during 2013--wtf?"
- J
On Tue, Apr 30, 2013 at 1:53 PM, Aaron Halfaker <aaron.halfaker(a)gmail.com
wrote:
Very cool!
Is there an easy way we could detect some false-positives? I'm
imagining blocks that were quickly reversed.
-Aaron
On Tue, Apr 30, 2013 at 3:44 PM, Oliver Keyes <okeyes(a)wikimedia.org>wrote;wrote:
Per Howie's prompting (this was cool! You
should send it out to the
team) some research I did in my spare time -
http://blog.ironholds.org/?p=31
Planning to do a pile of followup work, so any feedback, hypotheses or
requests for info gratefully received.
--
Oliver Keyes
Community Liaison, Product Development
Wikimedia Foundation
_______________________________________________
EE mailing list
EE(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/ee
_______________________________________________
EE mailing list
EE(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/ee
--
Jonathan T. Morgan
Research Strategist
Wikimedia Foundation
_______________________________________________
EE mailing list
EE(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/ee
--
Oliver Keyes
Community Liaison, Product Development
Wikimedia Foundation
_______________________________________________
EE mailing list
EE(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/ee
--
Jonathan T. Morgan
Research Strategist
Wikimedia Foundation