-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Hi Andy,
Andy Rabagliati a écrit :
On Mon, 16 Nov 2009, Andy Rabagliati wrote:
I would like to see categories - do they work ?
It depends what do you mean exactly.
In every dump I currently do (http://tmp.kiwix.org/zim/), categories are avoided and I think it will stay like that as long the ZIM format does not support it natively.
So, the ZIM format does not support natively a cat. system but this is a feature request: http://bugs.openzim.org/show_bug.cgi?id=1
... but every ZIM creator can make the choice to integrate categories as HTML pages (but not so trivial IMO).
Searches can be title search, article lede search, full text search.
I understand your problem. In fact, we have many search engine solutions but currently nothing which seems to be really what you need (easy to index a ZIM file and available to build an HTTP server). This is a pity because this would really good and the most complicated part of the code is already there.
Tommi, maybe you can help Andy to work with the openzim search engine? Andy has been working since a long time in South Africa to spread Wikipedia content (offline).
These indexes http://ai.cs.utsa.edu/wikipedia0.7/ seem to have been built using categories.
This dump is one I have build (maybe extract from the ZIM)... but a little bit modified. This a pretty interesting url, would be great to know how the dev. behind have done exactly... maybe you would be able to do the same.
Is that a part of the zim file too ?
Nothing to do with openzim... but he re-uses my work.
Maybe that is good enough ?
Maybe :)
Good enough for search, but not good enough for the page until the category link is on the page itself, so we can easily go from "Gamma ray burst" to other pages in the category Astronomy.
I think you won't have that soon because: * I didn't it (and all WP0.7 content are issue from my original ZIM file) * I know nobody which is currently able to do that cleanly.
Regards Emmanuel