Perhaps all from you already know this, but I only recently discovered that
pdf2djvu converts a *searchable pdf* into a *searchable djvu* (t.i. uploads
anything from pdf to djvu, active links and metadata too) and I like to
share my "discover". Conversion is extremely simple. Unluckily, we use only
a little bit of djvu text data - usually only the whole, unmapped text, the
only exception being hOCR tool by Phe, that outputs mapped text.
Alex
2015-10-15 13:45 GMT+02:00 billinghurst <billinghurstwiki(a)gmail.com>om>:
Also to note that User:Dominic was a wikimedian in
residence with NARA in
the States and had a large number of files uploaded, and components of
transcription project for those uploads.. They have their own template at
Commons, so you should be able to dig them up.
Regards, Billinghurst
On Thu, Oct 15, 2015 at 9:18 PM Arne Wossink <wossink(a)wikimedia.nl> wrote:
Hi all,
Wikimedia Nederland has recently approached by several institutions that
would like to do uploads of source material. Wikisource would be the
preferred platform for this as the material would be searchable (which it
wouldn't be if it was only uploaded as pdf to Commons).
I would like to know if there have been previous projects involving large
uploads by institutions, and if there's any documentation on how to proceed
with these.
Thanks!
Arne Wossink
Projectleider / Project Lead Wikimedia Nederland
Tel. +31 (0)6 11000505
*Postadres*:
* Bezoekadres:*
Postbus 167 Mariaplaats 3
3500 AD Utrecht Utrecht
_______________________________________________
Wikisource-l mailing list
Wikisource-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l
_______________________________________________
Wikisource-l mailing list
Wikisource-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikisource-l