About the dead links, few thinks:
* Are you sure the problem is not at the source (HTML files)
* the zimwriter does not check if all links in all HTML pages are OK
* it seems that the libzim returns a bad content if the content does not exist (Tommi can
you confirm?). Should returns nothing or an error code IMO.
Emmanuel
Le lun 06/07/09 14:58, "Rotem Simha" hidroo(a)gmail.com a écrit:
> * there are some errors in links of files and special pages
> examples
> קובץ:Nuvola_apps_important.svg [1] link to
> ויקיפדיה:מיזמי ויקיפדיה/מיזם ערכים
> ללא תמונות/קטגוריות/ספורטאים איטלקים
> (wikipedia:wikipedia projects articles without imagescategoriesSports
> people from Italy)
> מיוחד:אקראי (Special:Random) > 15 במאי (may 15)
> מיוחד:שינויים אחרונים (Special:RecentChanges) >
> 10_באוגוסט
>
> * size is important because we intend to add images
>
> 2009/7/6
> Send dev-l mailing list submissions to
> dev-l(a)openzim.org
>
> To subscribe or unsubscribe via the World Wide Web, visit
>
https://intern.openzim.org/mailman/listinfo/dev-l [2]
> or, via email, send a message with subject or body help to
> dev-l-request(a)openzim.org
>
> You can reach the person managing the list at
> dev-l-owner(a)openzim.org
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of dev-l digest..."
>
> Todays Topics:
>
> 1. Kiwix index size (Asaf Bartov)
> 2. Re: Kiwix index size (Manuel Schneider)
> 3. Re: Kiwix index size (Emmanuel Engelhart)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Sun, 5 Jul 2009 19:18:57 +0300
> From: Asaf Bartov
> Subject: [openZIM dev-l] Kiwix index size
> To: dev-l(a)openzim.org
> Message-ID:
>
>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Hi, everyone.
>
> When running Kiwixs indexer on the ZIM file I had created from the
> Hebrew
> Wikipedia last week, the Kiwix data directory ran up to a total of
> 31 items,
> totalling 2.3 GB. The ZIM file itself is ~300MB. Does this
> proportion make
> sense?
>
> Detailed ls output attached.
>
> Thanks in advance,
>
> Asaf Bartov
> --
> Asaf Bartov
>