Hi Erik, I'm crossposting this message to the wikisource-l, if anyone is interested to give some inputs.

The http://stats.wikimedia.org/wikisource/EN/TablesDatabaseWords.htm seens to be inaccurate. Apparently your tool compute only words in the main namespace. It may works for projects like Wikipedia and theirs very long talk pages at the namespace Project: on some subjects (such as deletion requests). But it doens't work for Wikisource for two main reasons:

1) Some subdomains have custom namespaces for short biographies and list of works by author (en, it, pt and others), some have it on the main namespace (fr, de, es and others). This is a minor issue, since the amount of words on those pages is small

2) Some Wikisources (de, fr and en, according to http://wikisource.org/wiki/Wikisource:ProofreadPage_Statistics ) have large amount of contents in a custom namespace devoted to the ProofreadPage Extension ( http://www.mediawiki.org/wiki/Extension:Proofread_Page ). This content is displayed on main namespace within page transclusion (see http://en.wikisource.org/w/index.php?title=35_Sonnets&action=edit for an example).

Is possible to include the custom namespaces for all Wikisources on your automated calculation tool?

[[:m:User:555]]