[Wikipedia-l] Categories indexed for largest wikipedias

Erik Zachte e.p.zachte at chello.nl
Tue Jan 4 03:51:40 UTC 2005


After rereading my explanation about the article counts, it still seems a
bit vague.
Let me try again:

One a first pass through the database all categories and their mutual
relations are collected.

On a second pass a list is compiled for each article of all categories that
are named in the article and of their supercategories, duplicates are
removed. Then all categories in the list have their counter incremented by
one.

Erik Zachte





More information about the Wikipedia-l mailing list