Yes, presently IA jp2.zip are the source files for all derived ones and for OCR. All the derived ones are omologous - t.i. relative coordinates of any element inside images are identical, even if image size varies. This means that mapping of elements (images or text) can be exported into any derived file.

Just an example: when an user crops an image from a djvu file by the excellent CropTool by Danmichaelo, coordinated of the cropping could be used to crop high-resolution jp2 or jpg image, or to get coordinates of any piece of text mapped by OCR.

Alex

2017-01-27 0:53 GMT+01:00 Sam Wilson <sam@samwilson.id.au>:

Good to know, thanks!

So, we just stick with jp2.zip

And I love the IA magic :)

On Fri, 27 Jan 2017, at 07:40 AM, Andrea Zanni wrote:

AFAIK, IA always produce the jp2 files by himself.

I suggest GLAMs to upload zipped folders of jpegs,

so IA can do his magic and produce a book viewer and a PDF as well as the jp2.

On Fri, Jan 27, 2017 at 12:10 AM, Sam Wilson <sam@samwilson.id.au> wrote:

On Thu, 26 Jan 2017, at 06:35 PM, Andrea Zanni wrote:

The problem for me is that librarians and other people who are genuinely interested in Wikisource and IA

don't understand why

* they upload a good scan on IA

* see a good book on IA, via the viewer

* get an horrible djvu on Wikisource.

This is the issue we should try to solve, otherwise we will lose a potential important ally, content and new userbase.

Aubrey

Definitely!

On a related note: most (all?) IA-scanned books have e.g. *_jp2.zip files containing all the original scan images, but is there any standard for user-uploaded books? Like your librarians above, I assume they're uploading individual jpg/png files? Do these get combined into a single zip? I'm thinking that they don't, and that ia-upload needs to provide the option of using any of the following sources:

.djvu
_jp2.zip (there's also _jpg.zip and _raw_jp2.zip, but I guess we don't need to use them?)
*.jpg + *.jp2 + *.png (i.e. use all images in the item, apart from _cover_image.jpg)
.pdf

Sound complete? Or are there other ways?

_______________________________________________

Wikisource-l mailing list

Wikisource-l@lists.wikimedia.org

https://lists.wikimedia.org/mailman/listinfo/wikisource-l

_______________________________________________

Wikisource-l mailing list

Wikisource-l@lists.wikimedia.org

https://lists.wikimedia.org/mailman/listinfo/wikisource-l

_______________________________________________

Wikisource-l mailing list

Wikisource-l@lists.wikimedia.org

https://lists.wikimedia.org/mailman/listinfo/wikisource-l