Re: [Foundation-l] Printed on Demand Books from Wikipedia articles

13 Jul 2006

Anthony schrieb:
...
  On 7/13/06, Volker Haas
&lt;volker.haas(a)brainbot.com&gt; wrote:

  Hi Robert

 Robert Scott Horning wrote:

  I am curious how you plan to compile these books
in a format that looks
 good on paper and not strictly as a web format.  While this can be
 automated to an extent, I do think there are some issues that come from
 trying to move content from a web format to a printed page, and not all
 of these can be completely automated.
         As you have pointed out, an automatic conversion of html (or mediawiki
 markup) into latex can not be automated in a way that the book is
 absolutely perfect.
 But right now we are pretty satisfied with the results - even though we
 are still working on improvments of the conversion.
 The conversion is done with a parser we developed from scratch - as
 mentioned in a previous post. The mediawiki markup is translated
 into an internal representation which then gets transformed to latex (to
 be more precise "context") - this is the hard part. It is slightly easier to
 transform the internal representation back to html since some css-style
 information can be maintained.

       As I've mentioned before, I'd love to see a Wikimedia project which
 does exactly this.  Then the latex could be edited collaboratively to
 make things more "absolutely perfect".  It's nice to see it's at least
 somewhat possible, though I'd say the quality of the previews right
 now is fairly low.
    My wiki2xml converter is in the subversion repository. It is a set of
scripts to convert MediaWiki markup into XML, and from there into other
formats, including plain text, HTML, DocBook, and ODT
(OpenDocument/OpenOffice format). OpenOffice is also open source and can
generate PDFs natively.

I admit that wiki2xml is currently less advanced than the impressive
pediapress software, as it does not render <math> tags, does not make a
list of figures, etc. which is due to me being busy, and being too lazy
merging in patches by others ;-)

Magnus

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

2005

2004

Re: [Foundation-l] Printed on Demand Books from Wikipedia articles