- Support Home
- Software and Drivers
- Contact Us
- Business Support
- My HP Account
HP multifunction printers (MFPs) with Flow technology include an embedded optical character recognition (OCR) feature. OCR provides the ability to convert printed pages into text that can be edited or searched using a computer.
The OCR options are available for the following scan features:
Scan to USB Drive
Scan to Job Storage
Scan to Network Folder
Scan to Email
Scan to SharePoint©
When setting up a scan job, select an OCR file format from the File Type and Resolution menu on the printer control panel.
The following OCR file types are available:
Text (OCR): With this format, the OCR conversion process ignores all graphics. It analyzes the text characters, converts them to text, and then saves them in an unformatted, ASCII-encoded text file with the extension ".txt."
Unicode Text (OCR): With this format, the OCR conversion process ignores all graphics. It analyzes the text characters, converts them to text, and then saves them in a text-only Unicode file with the extension ".txt." Unicode files provide increased cross-platform compatibility.
The icons for Text and Unicode text files are identical. Open the file in an enhanced text editor such as Notepad ++ to view the encoding method.
RTF (OCR): This is a Rich Text Format file that contains both text and graphics. The color, type style, and typeface are preserved as much as possible.
Searchable PDF (OCR): This file type creates a PDF image of the document with a hidden text layer. This method preserves the exact look of the original file, but enables searching for text within the PDF. The Searchable PDF format preserves embedded video, audio, and hyperlinks in the PDF.
Searchable PDF/A (OCR): Like the Searchable PDF format, the Searchable PDF/A file creates an image of the original document with a hidden text layer. PDF/A files are intended for long-term archiving, and cannot rely on any plug-ins to the PDF viewer or any external references that might not be available when the PDF is viewed from an archive. PDF/A files cannot include embedded video, audio, or any external resources, such as hyperlinks. PDF/A files cannot be encrypted.
The icons for PDF and PDF/A files are identical. When viewing PDF/A in Adobe© Acrobat Reader a banner at the top of the window appears with the message: "You are viewing this document in PDF/A mode."
HTML (OCR): This option creates a self-contained Web page of the scan. The result is a .zip file that contains a folder with the images and HTML code necessary to display the page. Extract the .zip file, and then double-click the filename.htm file to view the document in a Web browser.
CSV (OCR): Comma separated value (CSV) files are text files where each entry or line is separated by a comma character. Use CSV files to import data into databases or spreadsheet programs, such as Microsoft Excel.