On Apr 2, 2004, at 10:14, Ray Saintonge wrote:
Brion Vibber wrote:
On Apr 2, 2004, at 00:53, Jimmy Wales wrote:
It shouldn't run more than once per day at first. I'm not sure what their goals are with respect to how often they would *like* to receive it, but daily is a fine start.
It would take hours just to run a complete dump, which would be the equivalent of a sizeable fraction of our total daily page views. (Best case might be 100ms per page for 240,000 pages =~ 6 hours 40 minutes)
If we're going to run something like this daily, some sort of incremental updates are a must, though we can probably get away with stuffing the saved data per page in a database or such and slurping it back out fairly quickly.
I can't see where the proposal that some have made to have all the projects on one database would help this situation.
I can't see how it would hurt, either. In fact it doesn't seem to have any bearing on this at all.
-- brion vibber (brion @ pobox.com)