On Sunday 22 September 2002 05:38 am, Robert Graham Merkel wrote:
I'd like to suggest another occur at another milestone. I realise the problems with the article number targets, but they do serve as something useful to point to the media about. If we did, we'd of course have to be careful *not* to claim that "we have X number of articles, hence we're X/100,000 as good as Britannica at this point".
Exactly why I have mentioned several times now that we need to have a more conservative definition of what we let the software detect as articles. There seemed to be some consensus for this yet nothing has happened. I would write the new code myself but I know exactly zippo about php and have zero time to learn it.
We are already at 45,000+ "articles", which is way too close to the "50%" mark for my taste. In order to preserve legitimacy, we should be conservative in our total reporting of these numbers.
Living organisms requires a certain absolute minimum number of genes to be considered to be alive at all (which is around 300, but for practical purposes it is really a bit higher). I believe articles are similar and that it is not really possible to have an encyclopedia article on any subject that is less than 500 bytes (although for practical reasons this number is very often, but not always, too low).
So can we at least add a 500 byte filter to the current article count spec? As has already been suggested, we can also add a line to the stats page that lists these "entries previously detected as being articles but are less than 500 bytes".
If we did have such a spec then there would be more reason than ever for people to make sure sub-500 byte stubs are fleshed-out.
-- Daniel Mayer (aka mav)