yvi: woman showing her biceps, text: "We can DW it" (Dreamwidth)
[personal profile] yvi posting in [community profile] dwrocks
... because they use http://recaptcha.net/

To archive human knowledge and to make information more accessible to the world, multiple projects are currently digitizing physical books that were written before the computer age. The book pages are being photographically scanned, and then transformed into text using "Optical Character Recognition" (OCR). The transformation into text is useful because scanning a book produces images, which are difficult to store on small devices, expensive to download, and cannot be searched. The problem is that OCR is not perfect.

reCAPTCHA improves the process of digitizing books by sending words that cannot be read by computers to the Web in the form of CAPTCHAs for humans to decipher. More specifically, each word that cannot be read correctly by OCR is placed on an image and used as a CAPTCHA. This is possible because most OCR programs alert you when a word cannot be read correctly.


[Yes, this is very random]

Profile

dwrocks: Imagination Unlimited (Default)
Dreamwidth Rocks

August 2014

S M T W T F S
     12
3456789
1011121314 1516
17181920212223
24252627282930
31      

Style Credit

Expand Cut Tags

No cut tags