Hi, everyone.
When running Kiwix's indexer on the ZIM file I had created from the Hebrew Wikipedia last week, the Kiwix data directory ran up to a total of 31 items, totalling 2.3 GB. The ZIM file itself is ~300MB. Does this proportion make sense?
Detailed ls output attached.
Thanks in advance,
Asaf Bartov -- Asaf Bartov asaf@forum2.org
Hi Asaf,
Am Sonntag, 5. Juli 2009 schrieb Asaf Bartov:
When running Kiwix's indexer on the ZIM file I had created from the Hebrew Wikipedia last week, the Kiwix data directory ran up to a total of 31 items, totalling 2.3 GB. The ZIM file itself is ~300MB. Does this proportion make sense?
I am not sure about the other files which were created, you only need the ZIM file with the index itself.
For 900'000 articles the ZIM file containing the articles was 1.4 GB, the Index ZIM was 1.0 GB.
So I think 300 MB looks fine.
Greets,
Manuel
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Hi Asaf Asaf Bartov a écrit :
When running Kiwix's indexer on the ZIM file I had created from the Hebrew Wikipedia last week, the Kiwix data directory ran up to a total of 31 items, totalling 2.3 GB. The ZIM file itself is ~300MB. Does this proportion make sense?
this is possible. Kiwix uses the Xapian search engine which generates pretty big index files.
I have to questions: * Are the search results OK? * Do you have a problem with the size of the index? Do you have a size limit?
They are many open search/index softwares. I choose to use Xapian for many reasons, but this is possible under certain condition to add to Kiwix the support to an another search engine. This should be also possible to make a modified version of the indexer using less disk space (but with less words indexed).
OpenZIM itself provides a search solution, Tommi can explain you more about it. Maybe it would be interesting for you to test it and give us a feedback!
Regards Emmanuel