Re: [Wikitech-l] dump format

6 Jun 2005


      elwp@gmx.de wrote:
...
Not only history blobs can benefit from splitting revision
texts into sections and sorting them. The sizes of XML exported
pages (with complete page histories) can also be reduced.
This is the current structure (only relevant tags):
<page>
  <revision><text>text0</text></revision>
  <revision><text>text1</text></revision>
</page>
This would be the new structure:
<page>
  <section>sectiontext0</section>
  <section>sectiontext1</section>
  <section>sectiontext2</section>
  <revision><text type="sectionlist">0 1</text></revision>
  <revision><text type="sectionlist">0 2</text></revision>
</page>
Can you show that this does significantly better than gzip? Certainly it
won't simplify dump processing.
-- brion vibber

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

2003

2002

Re: [Wikitech-l] dump format