On 23 June 2010 21:31, Mariano Cecowski <marianocecowski(a)yahoo.com.ar> wrote:
--- El mié 23-jun-10, Michael Peel
> I always think than not using reCaptcha is a
shame, as it's
> a nice way to get people to proofread text in a reasonably
> efficient way. It would be really nice if someone could
> create something similar that proofreads OCR'd text from
> Wikisource... <hint, hint>.
And how do you decide that what was entered is wrong
It turns out that having several randomly-selected people check a
given recaptcha is very accurate indeed.
"But if a computer can't read such a CAPTCHA, how does the system know
the correct answer to the puzzle? Here's how: Each new word that
cannot be read correctly by OCR is given to a user in conjunction with
another word for which the answer is already known. The user is then
asked to read both words. If they solve the one for which the answer
is known, the system assumes their answer is correct for the new one.
The system then gives the new image to a number of other people to
determine, with higher confidence, whether the original answer was
Your question is similar to "But if anyone can edit Wikipedia, how do
you know what's entered will be accurate?"