http://bugs.openzim.org/show_bug.cgi?id=34
Summary: zim files with umlaut in file name do not open
Product: openZIM
Version: unspecified
Platform: All
OS/Version: All
Status: NEW
Severity: normal
Priority: P5
Component: zimlib
AssignedTo: tommi(a)tntnet.org
ReportedBy: cip(a)gmx.at
CC: dev-l(a)openzim.org
Estimated Hours: 0.0
To reproduce problem:
Rename a zim file so that it contains an umlaut.
e.g. film.zim to filmö.zim
Open in kiwix or wikionboard
Open fails.
Error messages:
Kiwix: Unable to load ... filmö.zim. Sind sie sicher, dass dies eine ZIM-Datei
ist?
WikiOnBoard: Error 86 opening file "... filmö.zim": Illegal byte sequence
(Tested with http://openzim.org/download/zim5/film.zim)
--
Configure bugmail: http://bugs.openzim.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
http://bugs.openzim.org/show_bug.cgi?id=18
Summary: debian needs an init.d script
Product: openZIM
Version: unspecified
Platform: PC
OS/Version: Linux
Status: NEW
Severity: enhancement
Priority: P5
Component: zimreader
AssignedTo: tommi(a)tntnet.org
ReportedBy: andyr(a)wizzy.com
CC: dev-l(a)openzim.org
Estimated Hours: 0.0
One attached.
--
Configure bugmail: http://bugs.openzim.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
Dear all,
Wikimania is approaching and by that also our first developers meeting
in 2011!
The openZIM team is happy to invite you to the first (really)
multinational developers meeting. After three meetings in the center of
Europe with mostly people from that area participating we are now going
to meet at Wikimania.
Prior to Wikimania are special conferences. The openZIM meeting is on
August 2nd and 3rd - the two days right before Wikimania starts - at
Beit Hecht, part of the Wikimania venue.
Please sign up here and participate in the planning:
* http://wikimania2011.wikimedia.org/wiki/OpenZIM_Developers_Meeting
For dedicated offline people there is still budget left so we can help
you funding your participation at this meeting! Contact me for this.
I'd be happy to see you there!
Manuel
--
Regards
Manuel Schneider
Wikimedia CH - Verein zur Förderung Freien Wissens
Wikimedia CH - Association for the advancement of free knowledge
www.wikimedia.ch
http://bugs.openzim.org/show_bug.cgi?id=26
Summary: CRC check
Product: openZIM
Version: unspecified
Platform: All
OS/Version: All
Status: NEW
Severity: enhancement
Priority: P3
Component: zimlib
AssignedTo: tommi(a)tntnet.org
ReportedBy: emmanuel(a)engelhart.org
CC: dev-l(a)openzim.org
Estimated Hours: 0.0
Downloaded ZIM files are unfortunately not always fully valid at the end of the
download stage. Consequently users may have a bad user experience without
having a chance to easily check if the file is valid.
It would be great to provide in the zimwriter/zimlib a way to check the zim
file integrity easily.
A way to achieve that could be:
* At the end of the ZIM file creation process (file is created) compute and
append at the end of the file a CRC (md5sum, sha1)
* In the zimlib add a method checkIntegrity() or something similar able to
compute the CRC from the file (excepting the CRC hash at the end) and make the
comparison.
Hash algorithm should be fast and reliable (md5?).
--
Configure bugmail: http://bugs.openzim.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug.
cross posting to Offline & Wikimedia-India.
Hi
I'm working on the Wikimedia Foundation's initiatives in India and I'm reaching out for help on a really exciting opportunity. Assam is a state in the North-East of India. The Government has an interesting scheme to give laptops to deserving students leaving secondary school (i.e., completed 10 years of education and about 15-16 year olds.) This scheme is being managed by an organisation called Amtron - who have issued a Tender for the procurement of these laptops.
There are 19,000 laptops that will be distributed in this initiative. (These are on Ubuntu Linux - and very reasonably configured.)
Assam has traditionally had a problem with infrastructure and Internet access is a problem. Someone who is supporting Amtron has asked the Foundation if we can give them an offline version of Wikipedia to pre-load onto these computers. Given that it is for 15-16 age group, it does need to be of appropriate content. They'd like it to have topics of academic interest covered (e.g., classical sciences, humanities, literature and accountancy.) Ideally we'd like them to also have articles on India (e.g. history, geography, culture, etc.) as well as other areas of general interest (e.g., music, sports, etc.) Currently, everything is required in English only.
While these laptops aren't going to necessarily be in classrooms, given that they will be with some students, it's safe to assume that other students, friends and relatives would access these. Given the context of Assam, I thank we can easily assume that 10 people would access these computers. That adds up to improving access for nearly 2,00,000 people! I'm really inspired by the potential of this partnership because because it allows us huge scale with efficiency in effort.
I understand that Wikipedia for Schools is readying for release sometime in July 2011 - and the timing couldn't be better.
Can you help us out with
a) how you could help on adding the additional articles that this initiative would require? (You could also sign up on the Volunteer Page)
b) how fast this can be given to Amtron? (They are looking for the inputs in July 2011.)
c) any other ideas that you think might be useful?
Many thanks.
Best,
Hisham Mundol
Wikimedia India Programs
skype : hisham.wikimedia
gtalk : hmundol(a)wikimedia.org
twitter : @mundol
Yes, it would be great if we could support this as well.
Hisham Mundol
Wikimedia India Programs
skype : hisham.wikimedia
gtalk : hmundol(a)wikimedia.org
twitter : @mundol
On Jun 23, 2011, at 3:35 PM, Arun Ganesh wrote:
> There's a slightly more nuttier scheme brewing in the south, where 900,000 laptops are going to be distributed to students next year [1]. Let me see if i can access someone in the TN govt regarding this.
> -Arun
>
> [1] http://ibnlive.in.com/news/free-laptops-will-have-tn-logo-burnt-on-chips/16…
>
> --
> j.mp/ArunGanesh
> _______________________________________________
> Wikimediaindia-l mailing list
> Wikimediaindia-l(a)lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/wikimediaindia-l
Hi Emmanuel,
So far as I remember when we have defined the metadata we also defined that these attributes we have defined are also mandatory, while additional metadata (ideally also defined by dublin core) may be used.
Manuel Schneider
Sent via mobile phone (+49 170 7740589).
----- Reply message -----
Von: emmanuel(a)engelhart.org
An: <dev-l(a)openzim.org>
Betreff: [openZIM dev-l] Make Metadatas mandatory?
Datum: Do., Jun. 16, 2011 12:38
Hi
With the multiplication of ZIM files and ZIM editors we need ways to
build overviews.
Without the metadatas they are not easy way to build these overviews
(You have to do it by hand every time).
This is waht I learned by coding the content manager of Kiwix which
works now.
So I think, having ZIM files with metadatas is essential, maybe even
more.
I think we have to push the usage of these metadatas (title,
description, creation date, creator, favicon, language) and I see 3
approaches:
(1) Make these informations mandatory
(2) Make a recommendation but nothing blocking
(3) Do nothing and be optimistic
What would be your preferred approach?
Regards
Emmanuel
_______________________________________________
dev-l mailing list
dev-l(a)openzim.org
https://intern.openzim.org/mailman/listinfo/dev-l
Sorry, now correctly cross posted.
Emmanuel
-------- Original Message --------
Subject: WMF XML dump title case problem
Date: Sun, 26 Jun 2011 17:07:19 +0200
From: Emmanuel Engelhart <emmanuel(a)engelhart.org>
To: Mailing list for Wikimedia CH <wikimediach-l(a)lists.wikimedia.org>,
offline-l(a)lists.wikimedia.org
Hi
Titles should be stored in the table "page" with a first letter uppercased.
http://en.wikipedia.org/wiki/Wikipedia:Naming_conventions_%28technical_rest…
Unfortunately, it seems that we have XML dumps (and consequently
mwdumper generated SQL) containing titles with a first letter lowercased.
For example:
$wget
http://download.wikimedia.org/mywiktionary/20110617/mywiktionary-20110617-p…
$bzip2 -d -c mywiktionary-20110617-pages-articles.xml.bz2 | grep
"<title>"| grep tationery | more
<title>stationery</title>
<title>stationery shop</title>
Is that a bug?
Regards
Emmanuel
Hi all,
What encoding is used for article, metadata, categories data, ... ,
respectively for the title and url strings in the directory?
I could not find documentation on this.
Simplest for handling in zim-viewers would be to define that everything is
encoded in UTF-8. This should work with
all languages.
Other option would be to define the encoding either for the comple zim file
(e.g. in metadata), or on-a-per article (html-tag in header).
It would make sense to restrict the possible encodings to some small subset, as
else reader are not compatible with all zim-files.
In case a-per-article encoding is to be supported, it would be necessary to
specify the encoding of the directory entires separately.
Disadvantages of this approach is the higher complexity for the reader, in
particular in the per-article approach. Furthermore the definition
is more complex. (for example it needs to be defined what encoding is used if no
encoding is specified in an article/metadata.)
I'd prefer to just define everything is UTF-8, but I am not sure whether this
has drawbacks I am not aware of.
However, I think it is very important that we define something about encoding,
because else we cannot support zim files
in all languages reliable.
Best regards,
Christian