On Thu, 16 Dec 2010 02:21:34 +1100, Andrew Dunbar hippytrail@gmail.com wrote:
I've long been interested in offline tools that make use of WikiMedia information, particularly the English Wiktionary.
I've recently come across a tool which can provide random access to a bzip2 archive without decompressing it and I would like to make use of it in my tools but I can't get it to compile and/or function with any free Windows compiler I have access to. It works fine on the *nix boxes I have tried but my personal machine is a Windows XP netbook.
The tool is "seek-bzip2" by James Taylor and is available here: http://bitbucket.org/james_taylor/seek-bzip2
- The free Borland compiler won't compile it due to missing (Unix?)
header files
- lcc compiles it but it always fails with error "unexpected EOF"
- mingw compiles it if the -m64 option is removed from the Makefile
but it then has the same behaviour as the lcc build.
My C experience is now quite stale and my 64-bit programming experience negligible.
(I'm also interested in hearing from other people working on offline tools for dump files, wikitext parsing, or Wiktionary)
Have a look to the openZim project, we have launched exactly for this purpose: * free software * supported by WMF * compiles on many systems * extremly fast * use of LZMA2 (better than bzip2) * though primarily for Wikipedia content * used in many other software (like Kiwix for example) * ...
Emmanuel