I'm bumping this question because apparently no one has answered yet, and I
think that it's a good question that should get an answer. I would like for
third-party researchers who are trying to align their work with Wikimedia
policies and practices in their research to get the support that they need
to make that happen. I have heard that WMF staff get requests for research
help on a regular basis, and the requests for assistance that are made on
the Analytics mailing list seem to be answered regularly, so I hope that
the same would happen on Research-l.
On Mon, Jun 4, 2018 at 6:46 AM, Yiqing Hua <yiqingh(a)google.com> wrote:
Lucas (cc'd) and I have been working on tools and corpora that transform
revision dumps into structured conversations; as part of this, we want to
make sure any down-stream services and corpora that we develop respect the
deleted (and suppressed) revisions; namely that we remove any copies we
of things deleted on Wikipedia.
For that we need a way to:
1. get all revisions IDs that were deleted or suppressed (or all
and non-suppressed ones)
2. have a way to get new deletions or suppresions so that we can remove any
copies that we have.
What's the right infrastructure/APIs to use for this?
Wiki-research-l mailing list