Text and bitmapped images are two different kinds of animals. Text can be typed, edited, copied, pasted, deleted, and processed. Images, however, are a bunch of pixels in a grid that combine in the right way to convey some sort of information: they resemble a photo, an illustration, or rendered text. So where can the two meet?
Optical-character recognition (OCR) was the name we gave to extracting text from images. But the term has gone out of favor as software increasingly and automatically tries to identify text in an image and make it searchable and, often, available for copying.
If you are trying to access text in images you have, whether documents, photos, or forms, you have many options available. These types also include PDFs with scanned images that have no text layer already inserted or extracted. You may already have a free account or paid subscription to one of the services below or own the software.
In researching this article, I tested a range of images and documents that proved fairly consistent across each service or app. For a side-by-side comparison that demonstrated my results starkly, I copied out the results of recognition against the same legibly typeset magazine copy from a 1920s Popular Mechanics article (about comic-strip production). You can see the figures below with each app or service noted. You probably won’t be performing text extraction against 1920s magazine articles—maybe so, if you’re like me!—but the slightly degraded nature of the source text and quality of the scan puts the services and software to a more substantial test than pristine rendered typography.
My testing involved using the public beta of macOS Monterey. Also, Apple notes as a footnote on the macOS Monterey preview website that an M1 is required. It doesn’t appear as if Live Text will be available on Intel Macs.
PDFpen and macOS Monterey’s Live Text performed extremely accurately. OneNote, once Microsoft had performed its delayed recognition, was quite close to those two as well. Evernote shows matches within the text as you type and appeared to rival Monterey and PDFpen. All four were overwhelmingly better than Acrobat and Google Docs, which had embarrassingly poor results.
macOS Monterey Live Text in Safari and Photos
In the upcoming release of macOS 12 Monterey (as well as in iOS 15 and iPadOS 15), Safari automatically recognizes text in images on a web page and in the Photos app when you’re viewing an image. You can select and copy that text. The feature requires Apple’s neural engine, available in M1 Apple silicon Macs and mobiles with an A12 Bionic chip or later, which appeared starting in some iPhones in 2018 and some iPads in 2019. You can test this out using the public beta. It does an excellent job.
Adobe Acrobat Pro DC
Opening a PDF within Acrobat Pro DC typically automatically starts text recognition. When complete, you can select any ranges of text to copy. OCR within Acrobat is part of a full Creative Cloud subscription ($52.59 to $79.49 per month), and Adobe offers Acrobat-specific plans as well (from $14.99 to $24.99 per month). The results, however, aren’t good.
Evernote performs OCR on any image or PDF with embedded images imported into the service or captured via a mobile device’s camera. This makes the text fully searchable, but it bafflingly doesn’t let you copy recognized text. (An exported PDF will require the text layer added, however.) The free tier allows searching text in images; the paid tier ($7.99 per month) is required for searching with PDFs, whether they include text or the text is extracted by OCR.
Google Drive and Google Docs
Available at free tiers and paid ones, you upload the PDF or image to Google Drive, either via Google Drive on your desktop or in a web browser. Then open the file in Google Docs. This action imports the image or PDF and pastes the extracted text with some formatted below. As you can see, the service didn’t perform well at all.
OneNote automatically checks any image pasted into a OneNote page for text. Control-click the image and select Copy Text from Picture. However, Microsoft notes, “The OCR Text recognition process is a very complex one that uses Microsoft online services and therefore can take a few minutes for simple pictures and up to hours for complex ones before the Copy Text from Picture command is available when you Control-click the picture.” Given that Apple, Google, and third-party apps can perform OCR instantly, perhaps OneNote is lagging, though the results are very good. OneNote is part of Microsoft 365 subscriptions.
PDFpen is an excellent app for working with PDFs. To covert text in PDFpen, choose Edit > OCR Page or hold down Option and choose Edit > OCR Document. If there are existing OCR text layers, you have to clear them first via Edit > Clear OCR Layer in Page/Document. PDFpen comes in regular ($79.95) and Pro ($129.95) versions. The job it did on my test was impressive.
Ask Mac 911
We’ve compiled a list of the questions we get asked most frequently, along with answers and links to columns: read our super FAQ to see if your question is covered. If not, we’re always looking for new problems to solve! Email yours to [email protected], including screen captures as appropriate and whether you want your full name used. Not every question will be answered, we don’t reply to email, and we cannot provide direct troubleshooting advice.
Best Strategic Sourcing Software for Client Experience Announced by SoftwareReviews In general, Strategic Sourcing users were most satisfied with vendors keeping pace with market directions and trends and implementing thorough strategy and innovation approaches. However, the users were most dissatisfied with vendors’ over-promising.
July 20, 2021