Arne Wossink, 15/10/2015 12:18:
Wikimedia Nederland has recently approached by several institutions that
would like to do uploads of source material. Wikisource would be the
preferred platform for this as the material would be searchable (which
it wouldn't be if it was only uploaded as pdf to Commons).
I would like to know if there have been previous projects involving
large uploads by institutions, and if there's any documentation on how
to proceed with these.
DjVU (and PDF?) files with a text layer *are* searchable in Commons since CirrusSearch was enabled (September 2014). Of course the search is only as good as the text: with poor OCR, it will be poor.
The main points are the same as for all batch uploads, see https://commons.wikimedia.org/wiki/Commons:Guide_to_batch_uploading ; as for actually creating pages in Wikisource, that's another matter, you must consider what the goals are and have a good plan.
BEIC uploaded about 1000 books in 2015 and will upload more in the future. We only created Index pages which we considered necessary and we didn't touch namespace 0. See pointers at https://it.wikipedia.org/wiki/Progetto:GLAM/BEIC/2015-07 (search "Wikisource") and https://it.wikisource.org/wiki/Wikisource:Collaborazioni/BEIC .
Nemo
_______________________________________________
Wikisource-l mailing list
Wikisource-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l