On Mon, Dec 13, 2021 at 3:06 PM Federico Leva (Nemo) nemowiki@gmail.com wrote:
Il 13/12/21 20:59, Samuel Klein ha scritto:
- WikiTeam, we love you, what do you need to be more effective at
archiving WM wikis?
Mainly, at least one volunteer willing to run the scripts in my stead so that it doesn't always fall on me. This is either a babysitting or a coding task to make things more reliable so it tends to be a project in the region of dozens or possibly hundreds of hours of work.
Got it -- that's for all the scripts, not just commons?
(You can save some work if you spend a few hundred dollars on good equipment and/or happen to have several TB of fast internet-connected disks few network hops from SFMIX and the WMF.)
Some UC or other uni on Internet2 in the US perhaps...
There's also the problem that people thought it smart to mirror millions of big Internet Archive files on Commons
We could use a dump that didn't include any files that have "source:IA" or an "archived-at" field.
At ArchiveTeam, as a
reference, we consider 2000 $/TB as a cost of an upload to IA.
Good to know, not cheap. Maybe not the right target for something we plan to replace / re-archive regularly.
S.