Robert Stojnic schrieb:
Aryeh Gregor wrote:
Right. Supporting category intersection and
search in category with
better UI (we already sort of support it if you know the right magic
terms) is what we should be aiming for here.
Last year, just around this time, we came to the exactly same
conclusion. And similarly like then, there is no shortage of good
opinions on how to do it, but people to actually do the programming.
r.
Wikimedia Germany has contracted Neil Harris to work on implementing deep
category intersection. The goal is basically a rewrite of my sucky CatScan tool.
The result is hopefully fast & generic enough so it can be used as a service
that integrates with the current search infrastructure.
The project has started, there is funding and a project plan. I expect to see
usable results soon. In fact, I hope to present this at the developer meeting in
april (neil, contact me about attending) and discuss the integration into lucene
search.
I agree that full recursive flattening of the current category structure leads
to bad results some times (especially on the english wikipedia, commons is quite
bad too), a depth of 5 however is generally useful. One common use case is
intersecting a content category with a maintenance category, for organizing
editorial work in a wiki project. In that case, at least one category comes from
a template.
Atomic categorization aka tagging however also sucks: the tags are either too
generic (so it's hard to find stuff) or too specific (you never know what to
search for). tags implying/including other tags is very useful. which is exactly
what categories with deep intersection will provide.
-- daniel