On sab, 2003-02-08 at 13:03, Magnus Manske wrote:
It seems that in each database dump, we have the
"search indexed"
article text as well, which contains the same text as the article, but
without special chars.
Can we not dump that field next time? It would reduce the file size (and
download time) by about 50%!
Already done!
The most recent revision of the software splits the search index fields
into a separate table (so the main tables can optionally use InnoDB
table type), with the side effect that the current revision dumps are
decreased in size by about 30%. (Not quite 50%; there's a lot of
redundancy between the text and the indexable text.) Here's the Dutch
dumps before and after:
4477364 Feb 2 08:17 20030202_cur_table.sql.bz2
2786244 Feb 4 08:52 20030204_cur_table.sql.bz2
Currently only English, Dutch, Esperanto, Polish, and Meta are on that
revision; I'll be upgrading the rest today, but I still haven't gotten
language file revisions for most languages, so some new bits of the
interface will appear in English.
-- brion vibber (brion @
pobox.com)