On Thu, May 17, 2012 at 8:11 AM, Thomas Dalton <thomas.dalton(a)gmail.com> wrote:
On 17 May 2012 12:43, Anthony
<wikimail(a)inbox.org> wrote:
In fact, I think someone at WMF should contact
Amazon and see if
they'll let us conduct the experiment for free, in exchange for us
creating the dump for them to host as a public data set
(
http://aws.amazon.com/publicdatasets/).
What dump are you going to create? You are starting from a dump, why
can't Amazon just host that?
Because the XML dump is semi-useless - it's compressed in all the
wrong places to use for an actual running system.
Anyway, looking at how the AWS Public Data Sets work, it probably
would be best not to even create a dump, but just put up the running
(object compressed) database.