On Wed, Oct 9, 2013 at 6:09 AM, Aaron Schulz aschulz4587@gmail.com wrote:
A smallish wiki with 10ks of pages and the full history and the table data (not just revision/page/*links stuff from dumps) would probably be useful. I'm not sure where the threshold roughly starts though.
We could get a script included in maintenance or at least vagrant to populate some abusefilters/etc. and then generate some fake accounts/prefs/edits/log entries/etc. maybe with selenium? (and the API. but the data won't be complete without edits made with an HTML interface.)
See https://github.com/fzaninotto/Faker or one of the several variants in other languages.
-Jeremy