• Subscribe

At Science Node, we need your help. We have ALMOST reached our fund-raising goal. In order to maintain our independence as a source of unbiased news and information, we don’t take money from big tech corporations. That’s why we’re asking our readers to help us raise the final $10,000 we need to meet our budget for the year. Donate now to Science Node's Gofundme campaign. Thank you!

iSGTW Link of the week - reCAPTCHA: stop spam, read books, beat the bots

Link of the week - reCAPTCHA: stop spam, read books, beat the bots


About 60 million CAPTCHAs are solved by humans around the world every day. In each case, roughly ten seconds of human time are being spent, which means more than 150,000 hours are spent every day on this task.
Images courtesy of reCAPTCHA

reCAPTCHA is free program that protects you from spam while furthering the fine goal of digitizing physical libraries, one word at a time.

CAPTCHAs (for Completely Automated Public Turing tests to tell Computers and Humans Apart) are used on many Web sites to distinguish between legitimate human users and automated "bots" that trawl the web to generate spam.

Using the ability of human users to decipher distorted text, CAPTCHAs prevent bots from navigating to protected Web sites.

Turning anti-spam in to a force for good

About 60 million CAPTCHAs are solved by humans around the world every day. In each case, roughly ten seconds of human time are being spent, which means more than 150,000 hours are spent every day on this task.

reCAPTCHA harnesses this power by channeling the effort spent solving CAPTCHAs into effort spent "reading" books.

To the rescue of Optical Character Recognition

Many pre-computer age books are being digitized to archive human knowledge and improve accessibility. Pages from such books are photographically scanned and these images are transformed into text using Optical Character Recognition or OCR.

The problem is that OCR is not perfect: when it bumps into a difficult word, reCAPTCHA is called tp the rescue.

reCAPTCHA transforms words that cannot be read by OCR in to CAPTCHAs for humans to decipher on the Web.

Each new word that cannot be read correctly by OCR is given to a human user along with another word for which the answer is already known.

The user is then asked to read and enter both words.

If they correctly enter the word that is already known, the system assumes their answer for the new word is also correct. The system then gives the new image to a number of other people to determine, with higher confidence, whether the original answer was correct.

If you run a Web site that suffers from problems with spam, you can put reCAPTCHA on your site. Suffer from email spam? Try mailhide, which allows you to safely post your email address on the web.


Join the conversation

Do you have story ideas or something to contribute? Let us know!

Copyright © 2019 Science Node ™  |  Privacy Notice  |  Sitemap

Disclaimer: While Science Node ™ does its best to provide complete and up-to-date information, it does not warrant that the information is error-free and disclaims all liability with respect to results from the use of the information.

Republish

We encourage you to republish this article online and in print, it’s free under our creative commons attribution license, but please follow some simple guidelines:
  1. You have to credit our authors.
  2. You have to credit ScienceNode.org — where possible include our logo with a link back to the original article.
  3. You can simply run the first few lines of the article and then add: “Read the full article on ScienceNode.org” containing a link back to the original article.
  4. The easiest way to get the article on your site is to embed the code below.