I'm working on the cleanup right now. I wrote a script for it.
csv files are cleaned up now. html files still contain private wiki data.
This week new counts have been generated from recent dumps (as often wp:en: is lagging behind). Hope to start reports job today to produce clean and up to date html files. I'll post on wikitech when all if finished.
Some background on what had happened: Someone created a new wiki and forgot to mark it as confidential. Wikistats job process all wikis listed in several *.dblist files in /home/wikipedia/commons So the new wiki was included automatically. Hence confidential article titles were browsable through the ZeitGeist feature.
(Brion, wikimania2007 is in wikipedia dblist, I'd expect it in special.dblist, anyway all mania dumps are now excluded)
I'll rather not do this cleaning up job a second time, from now on I'll use copies of dblists and sync updates on those manually every now and then.
Erik Zachte
-----Original Message----- From: Brion Vibber [mailto:brion@pobox.com] Sent: Sunday, 19 November 2006 18:43 To: wikipedia-l@Wikimedia.org Subject: Re: [Wikipedia-l] Wikipedia Statistics
Brion Vibber wrote:
Parker Conrad wrote:
Hi -- there used to be (as of a month or two ago) a very
useful website at
included historical
Wikipedia growth statistics -- very helpful for those of us
who are trying
to study the phenomenal growth of this community.
Unfortunately, it appears
to have been taken down, or perhaps moved. Does anyone know where I can access it?
These pages are down pending removal of some accidentally added private info.
I'll go ahead and try to fix these tonight; hopefully it won't be that hard...
-- brion vibber (brion @ pobox.com) _______________________________________________ Wikipedia-l mailing list Wikipedia-l@Wikimedia.org http://mail.wikipedia.org/mailman/listinfo/wikipedia-l