On 16 September 2010 17:58, Erik Moeller erik@wikimedia.org wrote:
For privacy reasons, we can't back up all data everywhere (e.g. user account information) -- it might be worth thinking about longer term strategies for portability of that data (e.g. a group of unaffiliated entrusted individuals who hold encryption keys). But, for the publicly available dumps, I don't see a list of mirrors prominently linked from http://dumps.wikimedia.org/backup-index.html -- I think starting a page at http://meta.wikimedia.org/wiki/Data_dumps/Mirrors with mirroring instructions (if such a page doesn't already exist somewhere), prominently highlighting it at dumps.wikimedia.org, and spreading the word would be a good start. We are already generating MD5s, so it shouldn't be hard for engaged community members to help with standard/policy setting, verification of mirror status, etc.
Surely dumps would be a natural for the Internet Archive and the Library of Congress.
- d.