Re: [WikiEN-l] Wikipedia in a box

14 Aug 2007


      On 8/14/07, Anthony wikimail@inbox.org wrote:
...
On 8/14/07, David Gerard dgerard@gmail.com wrote:
...
http://it.slashdot.org/article.pl?sid=07/08/13/1939231
bzip2recover.  Genius.  I've been wanting to do something like this
for a long time, and the one thing standing in my way was that I
couldn't figure out how to do the random access bit.
It appears to have a few flaws, though:
http://slashdot.org/comments.pl?sid=268617&cid=20222493
Basically, the approach of splitting the database into 900 kB chunks
means that you may end up splitting the XML headers between chunks,
making indexing miss a few articles.

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

2001

Re: [WikiEN-l] Wikipedia in a box