Re: [Wikisource-l] OCR as a service?

12 Jul 2015

On Sun, Jul 12, 2015 at 11:25 AM, Asaf Bartov &lt;abartov(a)wikimedia.org&gt; wrote:

...
  On Sat, Jul 11, 2015 at 9:59 AM, Andrea Zanni
&lt;zanni.andrea84(a)gmail.com&gt;
 wrote:

  uh, that sounds very interesting.
 Right now, we mainly use OCR from djvu from Internet Archive (that means
 ABBYY Finereader, which is very nice).

 Yes, the output is generally good.  But as far as I can tell, the
 archive's Open Library API does not offer a way to retrieve the OCR output
 programmatically, and certainly not for an arbitrary page rather than the
 whole item.  What I'm working on requires the ability to OCR a single page
 on demand.

 True. I've recently met Giovanni, a new (italian) guy who's now working
with
Internet Archive and Open Library.
We discussed about a number of possible parnerships/projects, this is
definitely one to bring it up.

But if we manage to do it directly in the Wikimedia world it's even better.

Aubrey

...

 _______________________________________________
 Wikisource-l mailing list
 Wikisource-l(a)lists.wikimedia.org
 https://lists.wikimedia.org/mailman/listinfo/wikisource-l

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

2010

2009

2008

2007

2006

Re: [Wikisource-l] OCR as a service?