Testing Google’s OCR – a sibilant intake of breath

Previously, I briefly mentioned the optical character recognition (OCR) technology within Google Docs. I decided to test it in the relatively challenging circumstance of converting photographs of pages from a book into text:

As you can see, the image to text conversion isn’t perfect. Indeed, it doesn’t work terribly well in the conditions to which I subjected it. Substantial strings of text are missing, and there are many errors.

Probably, the system would work better if the pages had been perfectly flat and evenly illuminated, and if my camera had been perfectly parallel to the page.

Author: Milan

In the spring of 2005, I graduated from the University of British Columbia with a degree in International Relations and a general focus in the area of environmental politics. Between 2005 and 2007 I completed an M.Phil in IR at Wadham College, Oxford. I worked for five years for the Canadian federal government, including completing the Accelerated Economist Training Program, and then completed a PhD in Political Science at the University of Toronto in 2023. View all posts by Milan

One thought on “Testing Google’s OCR”

I am surprised it works so poorly. How did they get so much into Google Books?

Author: Milan

One thought on “Testing Google’s OCR”

Leave a Reply