Friday, August 31, 2012

Convert A Jpeg Photograph To Ocr

To convert a JPEG photograph of a document to Optical Character Recognition (OCR), an interim conversion step is necessary. The JPEG is first converted to a Portable Document Format (PDF) document, and then the document is scanned with an Optical Character Recognition engine. Adobe Acrobat (full version) includes features to accomplish both tasks, with the ability to take the image from JPEG to PDF, and then convert the image characters in the document to rendered text using Adobe Acrobat's OCR engine.


Instructions


Convert JPEG to PDF


1. Click the Windows "Start" button and select Adobe Acrobat from the Programs list. The Adobe Acrobat application will launch.


2. Click the "File" option from the top navigation bar in Acrobat.


3. Select "Open" from the menu.


4. Click the downward pointing arrow on the "Files of Type" drop-down box and select "All Files (*.*)" to show files other than PDFs.


5. Navigate to and select the JPEG file. Click the file to load into the Acrobat application.


6. Click the "File" option from the top navigation bar, and then select "Save As..."


7. Select the "Adobe PDF (*.pdf)" file type from the "Save as Type" drop-down box.


8. Click the "Save" button. The JPEG is now converted to a PDF document with the file extension of .pdf.


Run OCR on the PDF


9. Click the "Document" option from the top navigation bar.


10. Select "Recognize Text Using OCR" from the context menu, and then click the "Start" link. The OCR engine will launch and the OCR dialog box will appear.


11. Click the radio button in front of "Current Page" and click the "Edit" button. Three selection boxes will appear.


12. Select the language of the document in the first drop-down box.


13. Select "Formatted Text and Graphics" from the second drop-down box.


14. Select a DPI option from the third drop-down box. This selection will determine the quality of the graphics in the converted PDF.


15. Click "OK" to save and close the OCR options, and then click "OK" again to start the OCR engine. The OCR engine will proceed through the document and convert each image character that is recognized, into rendered text. If the OCR engine encounters ambiguous characters, a dialog box will appear and the user will be required to type the character, word or phrase for clarification.


16. Enter characters, words or phrases for each ambiguous character, and click "OK" after each entry. Continue until the end of the document is reached. A notification will appear when the OCR process has finished.


17.Click "OK" to close the OCR engine.


18. Click the "File" option from the top navigation bar, and select "Save" to save the finished document. The JPEG has now been converted to OCR rendered text for searching and accessibility.







Tags: option from, Adobe Acrobat, from navigation, option from navigation, will appear, Click File option, File option