On Mon, Dec 13, 2021 at 3:06 PM Federico Leva (Nemo) <nemowiki@gmail.com> wrote:
Il 13/12/21 20:59, Samuel Klein ha scritto:
> 1. WikiTeam, we love you, what do you need to be more effective at
> archiving WM wikis?

Mainly, at least one volunteer willing to run the scripts in my stead so
that it doesn't always fall on me. This is either a babysitting or a
coding task to make things more reliable so it tends to be a project in
the region of dozens or possibly hundreds of hours of work.

Got it -- that's for all the scripts, not just commons?
 
(You can save some work if you spend a few hundred dollars on good
equipment and/or happen to have several TB of fast internet-connected
disks few network hops from SFMIX and the WMF.)

Some UC or other uni on Internet2 in the US perhaps...
 
There's also the problem that people thought it smart to mirror millions
of big Internet Archive files on Commons
 
We could use a dump that didn't include any files that have "source:IA" or an "archived-at" field.

At ArchiveTeam, as a
reference, we consider 2000 $/TB as a cost of an upload to IA.

Good to know, not cheap. Maybe not the right target for something we plan to replace / re-archive regularly.

S.