In the German wikipedia a list of the used qualifiers in the titles was discussed, and most of the participants think it will be a interesting feature.
I know I have expressed it wrong, therefore an example:
Cell (biology) is a homonym (cell) with a qualifier (biology). To get a proper list of those qualifiers and modify or eliminate wrong ones, a list would be very helpful.
The discussion result was that such a unique list of those qualifiers from titles (table cur) and bl_to (brokenlinks) would make sense.
Unfortunally I can give you only a proper PostgerSQL select statement (only table cur) but possibly someone can transfer this easy to mySQL:
======== WARNING: THIS IS NOT A VALID STATEMENT FOR WIKIPEDIA ========== SELECT DISTINCT substring(cur_title FROM '.+\((.+)\)') AS p FROM cur; ========================= I WARNED YOU ================================= -- "\(" ==> ( (needed for quoting) -- (.+) ==> the () - construct is used to select the part substring will return.
For the titles "foo", "foo (bar)", "foo2 (bar)" and "bar (foo)" the result will be "bar" and "foo".
This should only show the as-is state of these qualifiers! There is no intention for any automated process to enforce them, because no Wikipedian should get an error like "Qualifier not allowed" or so. This page is only for administrational and informational purposes!
Of course additional features, like showing the matched pages and others, would be nice, but there the discussion must go on further, IMHO.
If there are more questions, ask, I will try to answer them.
Smurf