I have answered to Rotem about the links. I have also open a bug on the Kiwix side: https://sourceforge.net/tracker/?func=detail&aid=2817440&group_id=17...
For the search engine index size, we have to search a solution with a smaller index. Starting with the openzim solution should be good. I will have a look during this week.
Emmanuel
Le lun 06/07/09 15:03, "Asaf Bartov" asaf.bartov@gmail.com a écrit:
Clarification:
This last message was by Rotem, a fellow WM-IL member helping me with the embedding of the Hebrew Wikipedia in the One Computer Per Child project.
He is reporting issues with Kiwix and the ZIM file I created last week.
Regarding size: Size is important, because we intend to add images (the 300MB ZIM file is the complete Hebrew Wikipedia text, but no pictures). We are hoping to have at least 5GB reserved for us in those One Computer Per Child machines we are to install on, but we may be forced to make do with 3GB. So every MB saved from the index, is another MB available for images...
Asaf Bartov Wikimedia Israel
On Mon, Jul 6, 2009 at 3:58 PM, Rotem Simha wrote:
- there are some errors in links of files and special pages
examples קובץ:Nuvola_apps_important.svg [1] link to ויקיפדיה:מיזמי ויקיפדיה/מיזם ערכים ללא תמונות/קטגוריות/ספורטאים איטלקים (wikipedia:wikipedia projects articles without imagescategoriesSports people from Italy) מיוחד:אקראי (Special:Random) > 15 במאי (may 15) מיוחד:שינויים אחרונים (Special:RecentChanges) > 10_באוגוסט
- size is important because we intend to add images
2009/7/6 Send dev-l mailing list submissions to dev-l@openzim.org
To subscribe or unsubscribe via the World Wide Web, visit https://intern.openzim.org/mailman/listinfo/dev-l [2] or, via email, send a message with subject or body help to dev-l-request@openzim.org
You can reach the person managing the list at dev-l-owner@openzim.org
When replying, please edit your Subject line so it is more specific than "Re: Contents of dev-l digest..."
Todays Topics:
1. Kiwix index size (Asaf Bartov) 2. Re: Kiwix index size (Manuel Schneider) 3. Re: Kiwix index size (Emmanuel Engelhart)
Message: 1 Date: Sun, 5 Jul 2009 19:18:57 +0300 From: Asaf Bartov Subject: [openZIM dev-l] Kiwix index size To: dev-l@openzim.org Message-ID: Content-Type: text/plain; charset="iso-8859-1"
Hi, everyone.
When running Kiwixs indexer on the ZIM file I had created from the Hebrew Wikipedia last week, the Kiwix data directory ran up to a total of 31 items, totalling 2.3 GB. The ZIM file itself is ~300MB. Does this proportion make sense?
Detailed ls output attached.
Thanks in advance,
Asaf Bartov
Asaf Bartov
Discussing the future format of the search index is also part of the upcoming developers meeting.
I'd like to encourage you to participate.
Vote for a weekend which will fit for you to have the developers meeting: http://www.doodle.com/xaf4tzpuwk2xf59h
Greets,
Manuel
Am Montag, 6. Juli 2009 15:15:45 schrieb emmanuel@engelhart.org:
I have answered to Rotem about the links. I have also open a bug on the Kiwix side: https://sourceforge.net/tracker/?func=detail&aid=2817440&group_id=17... id=873515
For the search engine index size, we have to search a solution with a smaller index. Starting with the openzim solution should be good. I will have a look during this week.
Emmanuel
Le lun 06/07/09 15:03, "Asaf Bartov" asaf.bartov@gmail.com a écrit:
Clarification:
This last message was by Rotem, a fellow WM-IL member helping me with the embedding of the Hebrew Wikipedia in the One Computer Per Child project.
He is reporting issues with Kiwix and the ZIM file I created last week.
Regarding size: Size is important, because we intend to add images (the 300MB ZIM file is the complete Hebrew Wikipedia text, but no pictures). We are hoping to have at least 5GB reserved for us in those One Computer Per Child machines we are to install on, but we may be forced to make do with 3GB. So every MB saved from the index, is another MB available for images...
Asaf Bartov Wikimedia Israel
On Mon, Jul 6, 2009 at 3:58 PM, Rotem Simha wrote:
- there are some errors in links of files and special pages
examples קובץ:Nuvola_apps_important.svg [1] link to ויקיפדיה:מיזמי ויקיפדיה/מיזם ערכים ללא תמונות/קטגוריות/ספורטאים איטלקים (wikipedia:wikipedia projects articles without imagescategoriesSports people from Italy) מיוחד:אקראי (Special:Random) > 15 במאי (may 15) מיוחד:שינויים אחרונים (Special:RecentChanges) > 10_באוגוסט
- size is important because we intend to add images
2009/7/6 Send dev-l mailing list submissions to dev-l@openzim.org
To subscribe or unsubscribe via the World Wide Web, visit https://intern.openzim.org/mailman/listinfo/dev-l [2] or, via email, send a message with subject or body help to dev-l-request@openzim.org
You can reach the person managing the list at dev-l-owner@openzim.org
When replying, please edit your Subject line so it is more specific than "Re: Contents of dev-l digest..."
Todays Topics:
1. Kiwix index size (Asaf Bartov) 2. Re: Kiwix index size (Manuel Schneider) 3. Re: Kiwix index size (Emmanuel Engelhart)
Message: 1 Date: Sun, 5 Jul 2009 19:18:57 +0300 From: Asaf Bartov Subject: [openZIM dev-l] Kiwix index size To: dev-l@openzim.org Message-ID: Content-Type: text/plain; charset="iso-8859-1"
Hi, everyone.
When running Kiwixs indexer on the ZIM file I had created from the Hebrew Wikipedia last week, the Kiwix data directory ran up to a total of 31 items, totalling 2.3 GB. The ZIM file itself is ~300MB. Does this proportion make sense?
Detailed ls output attached.
Thanks in advance,
Asaf Bartov
Asaf Bartov