On 12/25/05, Magnus Manske magnus.manske@web.de wrote:
Gregory Maxwell wrote:
On 12/25/05, Magnus Manske magnus.manske@web.de wrote:
For practical purposes, we should consider offereing a dump that contains
- articles
- templates (which are needed for many articles)
- a list of all author names
That would contain everything needed to generate GFDL-compliant, complete articles, but still be much smaller than the full dumps.
We could put this information in the "articles" dump (slightly larger download for everyone) or create a new dump type (more memory and processor power needed to create and offer it).
That is likely not enough, because you couldn't discover the principal authors that way.
I'm thinking about third party publications here; the WikiReaders and DVDs, for example. They cannot fulfil the GFDL with the current article dump, as that lists only the last author. However, with a list of all authors, they can just list all of them, which is enough under the GFDL. For a work consisting of several articles, they have to name each author only once in the book/DVD.
So, for what a thrid publishing party needs:
- articles only : Incomplete text, not GFDL-compliant; may not be
published in large numbers
- articles and templates: complete text, not GFDL-compliant; may not be
published in large numbers
- articles, templates, and authors: complete text, GFDL-compliant; may
be published in large numbers
The already have that in the full dumps, granted it includes other data as well but if you're going to publish something it's hardly a showstopper to download the full dump and extract the info you need from that.