[Wikipedia-l] New Wikistats

Sj 2.718281828 at gmail.com
Thu Mar 24 04:06:36 UTC 2005


On Mon, 21 Mar 2005 06:25:08 +0100, Erik Zachte <e.p.zachte at chello.nl> wrote:
> Finally, new wikistats.

My hiero!   Looks fantaastic.

> It took a couple of weekends to get the scripts up to date for the new
> database format.

I hope you're preparing in advance for 1.5.  ;-)

> Hierarchical category trees per Wikipedia (some are huge!):
> http://en.wikipedia.org/wikistats/EN/CategoryOverviewIndex.htm

Yes, huge... we need to find a better way of displaying these.   There
are some interesting tree-mapping packages out there...
 
> Keeping everything within reasonable memory boundaries was more difficult,
> wherever possible data are written to disk in several bins (e.g. one
> intermediate file per month history), sorted per bin, then merged before
> readback.
> 
> Erik Zachte

How long does calculation take now?  If there were a machine dedicated
to stats of various kinds, with its own mirror of the db, could this
be done more efficiently as a running-total, updated whenever the db
was updated?

-- 
+sj+



More information about the Wikipedia-l mailing list