On Aug 6, 2012, at 3:31 AM, Andrea Zanni <zanni.andrea84(a)gmail.com> wrote:
I'm
currently working with the BHL on a two-month, unrelated metadata project, part of which
is making sure that BHL's illustration metadata can be easily synced with content in
the Commons (see
http://commons.wikimedia.org/wiki/Template:Information_Art_of_Life for
more details). So I don't know anything about BHL's plans for Wikisource, but let
me know if I can help!
Personally, I'd love to see more (annotated!) biodiversity texts in Wikisource, such
as
http://en.wikisource.org/wiki/The_Salticidae_(Spiders)_of_Panama/Zygoballus -- these
species descriptions are the formal definition of a new species, and the BHL has been a
huge help in making these definitions available, online and for free, to taxonomists
everywhere (to say nothing of tons of gorgeous illustrations! [1]). However, their
transcription and indexing are largely automated, via OCR and text matching. Moving these
essential resources into Wikisource, where transcriptions and indexing could be improved
by hand, would be awesome!
To be honest I don't think they knew what Wikisource did before Wikimania. So I doubt
there is anything so firm as "plans". I know Aubrey and I, at least, spoke with
them. They are definately interested in the platform at Wikisource, but they want to
re-integrate the corrections made back into their collection. This is something that is a
problem with djvu files that we do not yet have an answer for.
Hi Gaurav,
I agree with Birgitte :-)
The BHL were very intrested in what we do, as were the guys from NARA.
The GLAM is a huge and crucial "dimension" ofr Wikisource future, I tried
(very, very badly) to list some things here.
<snip>
Another BIG issue is that we have a small userbase,
and I someone comes to us and says "here, you can have a gazillion books",
we won't proofread them untile the end of the times.
I think this is something who went "wrong" with the Gallica partnership
(by far the greatest Wikisource-GLAM collaboration)(I'm not blaming anyone, the
fr.source community did great, but the books were just too many).
Is this something we can work together on?
Reaching a critical mass of users is crucial for sister projects,
and if some specific community found some solutions it would be great to share.
I think this is a very important issue. I don't know any tried and true
When I spoke the BHL troika, I particularly described Wikisource as a platform. I also
explained to them that if they had any issues with working on our platform, they could
optionally install MediaWiki and Proofread page on their own server as it was all open
source. I did not wish to describe Wikisource as a *service* for proofreading and
transcription. We are really not capable of being such a service. We *are* capable of
providing the basic platform for transcription, of providing some guidance for presenting
texts with very limited tech support, and also of handling the interwebs aspect (not only
managing the servers and bandwidth but handling the user accounts, the privacy issues, the
spambots, the drawing of lines that must be drawn somewhere, and the finding of space for
the people who will inevitably show up at their project only to discover they would
actually rather be doing something else entirely with it all).
I think sometimes we downplay too much what we are capable of offering, because we would
wish to be able to offer more. Don't underestimate how valuable every piece of what we
offer is. Just WMF and our community set processes taking responsibility for the privacy
issues (and doing it competently by Internet standards) is an immense burden off of an
institution! Many of these things are not issues anyone would necessarily know to think
about before starting a crowd-sourcing project. But we should remember that they were (and
still are) hard issues for the WM movement to figure out. We now have won some of the
prize of age, the ability to draw on our own experience, and that of others within the
movement, in order to spare those that partner with us from some of the problems of
naïveté. This is meaningful. Even though we can never proofread all their documents with
the efforts of just our small community, what we can do for them is meaningful.
NARA with their dashboard, directing people from their community straight into the Page:
namespace of documents they want transcribed and proofread, is to my mind the way forward.
I would like see this dashboard concept developed further, so we can show other interested
institutions how to set dashboards up for their communities. So they might direct people
they are connected with toward working directly with institutional texts on Wikisource
without anyone getting too lost in in the wiki. I believe that we can be a platform for
this sort work by these institutions, that we can offer institutional supporters a low
barrier way to contribute directly to the mission they support, that we can help
facilitate the institutional "followers" becoming a real community that does
real work within the areas where our missions overlap. I don't believe we can, in good
faith, accept a large donation of institutional digital files from institutions which
expect the donation to be the beginning and end of their involvement with Wikisource.
Birgitte SB