[Foundation-l] English language dominationism is striking again

David Gerard dgerard at gmail.com
Thu Jun 24 12:12:17 UTC 2010

On 23 June 2010 21:31, Mariano Cecowski <marianocecowski at yahoo.com.ar> wrote:
> --- El mié 23-jun-10, Michael Peel <email at mikepeel.net> escribió:

>> I always think than not using reCaptcha is a shame, as it's
>> a nice way to get people to proofread text in a reasonably
>> efficient way. It would be really nice if someone could
>> create something similar that proofreads OCR'd text from
>> Wikisource... <hint, hint>.

> And how do you decide that what was entered is wrong or right?

It turns out that having several randomly-selected people check a
given recaptcha is very accurate indeed.


"But if a computer can't read such a CAPTCHA, how does the system know
the correct answer to the puzzle? Here's how: Each new word that
cannot be read correctly by OCR is given to a user in conjunction with
another word for which the answer is already known. The user is then
asked to read both words. If they solve the one for which the answer
is known, the system assumes their answer is correct for the new one.
The system then gives the new image to a number of other people to
determine, with higher confidence, whether the original answer was

Your question is similar to "But if anyone can edit Wikipedia, how do
you know what's entered will be accurate?"

- d.

More information about the wikimedia-l mailing list