Re: [Wikisource-l] Scanned texts

4 Jan 2008

...

 On en.ws, we do need to increase the percentage of
 texts with page
 scans.  Any suggestions on how to achieve this?

 --
 John

The biggest stumbling block at en.WS is that we
already had so many texts before the Page: feature. 
Many of the most commonly available files already
exist under a seperate edition.  I started to do some
Shakespeare (Romeo and Juliet) for example.  I found a
nice djvu file (my preference for working with) and
uploaded to Commons.  Soon I realized the text was
completely different.  I started adding it as a
seperate title but it is abandoned for the momoment. 
The original idea of being able to check the endless
IP revisions to Shakespeare was my primary motivation
and it turned out to be worthless for that.  

So if we really want to prioritize scans, on en.WS we
are naturally going to end up overwriting existing
texts.  Which is not necessarily a problem so long as
we go about it carefully.  Both in regard to peoples
attachment to past work and to ensure we are not
replacing texts with inferior editions or replacing
highly proofed text with OCR. 

The path of least resistance, which I always favor,
would be to start with a quality collection of files
of promenant texts.  I like .djvu files because you do
not need to upload individual pages to Commons.  Does
anyone have any suggestions about an existing
collection of scans available that we might want to
start with?

Birgitte SB

____________________________________________________________________________________
Never miss a thing.  Make Yahoo your home page. 
http://www.yahoo.com/r/hs

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Re: [Wikisource-l] Scanned texts