On en.ws, we do need to increase the percentage of
texts with page
scans. Any suggestions on how to achieve this?
--
John
The biggest stumbling block at en.WS is that we
already had so many texts before the Page: feature.
Many of the most commonly available files already
exist under a seperate edition. I started to do some
Shakespeare (Romeo and Juliet) for example. I found a
nice djvu file (my preference for working with) and
uploaded to Commons. Soon I realized the text was
completely different. I started adding it as a
seperate title but it is abandoned for the momoment.
The original idea of being able to check the endless
IP revisions to Shakespeare was my primary motivation
and it turned out to be worthless for that.
So if we really want to prioritize scans, on en.WS we
are naturally going to end up overwriting existing
texts. Which is not necessarily a problem so long as
we go about it carefully. Both in regard to peoples
attachment to past work and to ensure we are not
replacing texts with inferior editions or replacing
highly proofed text with OCR.
The path of least resistance, which I always favor,
would be to start with a quality collection of files
of promenant texts. I like .djvu files because you do
not need to upload individual pages to Commons. Does
anyone have any suggestions about an existing
collection of scans available that we might want to
start with?
Birgitte SB
____________________________________________________________________________________
Never miss a thing. Make Yahoo your home page.
http://www.yahoo.com/r/hs