On Mon, 21 Mar 2005 06:25:08 +0100, Erik Zachte e.p.zachte@chello.nl wrote:
Finally, new wikistats.
My hiero! Looks fantaastic.
It took a couple of weekends to get the scripts up to date for the new database format.
I hope you're preparing in advance for 1.5. ;-)
Hierarchical category trees per Wikipedia (some are huge!): http://en.wikipedia.org/wikistats/EN/CategoryOverviewIndex.htm
Yes, huge... we need to find a better way of displaying these. There are some interesting tree-mapping packages out there...
Keeping everything within reasonable memory boundaries was more difficult, wherever possible data are written to disk in several bins (e.g. one intermediate file per month history), sorted per bin, then merged before readback.
Erik Zachte
How long does calculation take now? If there were a machine dedicated to stats of various kinds, with its own mirror of the db, could this be done more efficiently as a running-total, updated whenever the db was updated?