[openZIM dev-l] wide character filenames support - Offline-l

17 Nov 2010


      Hi,
I have discovered an new bug on Kiwix for Windows:
 Kiwix is not able to open filenames containing accents, something 
 containing "wikipédia" for example.
This is a Windows specific issue, if I do the same on GNU/Linux, it 
 works.
I think the explanation is that ext4 uses UTF8 as charset and NTFS, 
 UTF16... and a code point in UTF16 needs 2 bytes although in UTF8, it 
 needs only one.
 As the method to open a ZIM file takes a char* as argument (so base on 
 1 byte) it works with UTF8.
This is a pretty new problematic for me and I simply try to understand 
 what need to be fixed.
 So feel free to give here a feedback.
For example, this is not clear if we need absolutely an (additional) 
 method accepting wide chars to open such files in the zimlib?
 This is also not clear for me if this is possible to build a generic 
 and portable solution here?
Regards
 Emmanuel