The big advantage with pbzip2 (if it's the program I've been looking at) is the unzip speed. I can't keep everything unzipped, and some utilities can't read bz2 files. Every time I unzip pages-articles I am into a long wait - before I get an error...
LZMA also offers a much improved unzip at the cost of a little more zipping time. Since many unzips occur and only one zip this seems like a good deal.
On 28/01/2012 23:11, Richard Jelinek wrote:
On Sat, Jan 28, 2012 at 11:35:56PM +0100, Platonides wrote:
On 28/01/12 00:38, Richard Jelinek wrote:
So to sum up: It's a no loose and two win situation if you migrate to pbzip2. And that just because pbunzip2 is slightly buggy. Isn't that interesting? :-)
Note that pbzip2 files are usually larger. And with our dump sizes, a small percentage increase could be "a lot". :)
Strange, I thought I am the pedantic one. ;-)
For the german wikipedia archive as of 2012-01-16 it's
2556353564 (bzip2) vs. 2557360903 (pbzip2)
that seems to be 0,039% more, I would call that a small percentage.
regards,