[Wikimedia-l] Fire Drill Re: Wikimedia sites not easy to archive (Was Re: Knol is closing tomorrow )

Anthony wikimail at inbox.org
Thu May 17 12:15:26 UTC 2012


On Thu, May 17, 2012 at 8:11 AM, Thomas Dalton <thomas.dalton at gmail.com> wrote:
> On 17 May 2012 12:43, Anthony <wikimail at inbox.org> wrote:
>> In fact, I think someone at WMF should contact Amazon and see if
>> they'll let us conduct the experiment for free, in exchange for us
>> creating the dump for them to host as a public data set
>> (http://aws.amazon.com/publicdatasets/).
>
> What dump are you going to create? You are starting from a dump, why
> can't Amazon just host that?

Because the XML dump is semi-useless - it's compressed in all the
wrong places to use for an actual running system.

Anyway, looking at how the AWS Public Data Sets work, it probably
would be best not to even create a dump, but just put up the running
(object compressed) database.



More information about the Wikimedia-l mailing list