Jump to Content | Jump to Navigation
Jump to Content | Jump to Navigation
Sponsor


Law Practice TODAY

Technology Tips sponsored by Adobe Systems, Inc.

Batch OCR using Adobe Acrobat Professional

by Rick Borstein, Adobe Systems, Inc.

March 2006

Have you ever received a PDF file that did not contain searchable text? You may know that you can use Acrobat’s OCR (Optical Character Recognition) to add an invisible layer of searchable text on top of the file. This allows you to select, copy and search text on a paper document. Great!

What do you do when you have hundreds of TIFFs and Image-only PDFs file that you need to search for a big case? Working with these documents one at a time is not efficient.

If you have Acrobat Professional, you can batch OCR and let you computer do the work for you.

Batch Processing to the Rescue

There are two steps to follow:

  1. Set up a Batch Sequence
  2. Run a Batch Sequence

Set up a Batch Sequence

Scan your documents locally or send to a PC where Acrobat Pro is installed.

If you have the capability, scan directly to PDF or to an MTIFF (multi-page TIFF). These formats allow all of the pages of a document to be maintained as a single file.

In Acrobat Professional, choose Advanced Batch Processing

Click the New Sequence button.

Image1

Give the sequence a name.

Click Select Commands

Image2

Choose Recognize Text Using OCR and click the Add button.

Image3

Double-click the Recognize Text using OCR text (right side of the window) to set OCR Options.

Set Downsample Images to 300 dpi. Click OK

[Image4

Click OK again to get back to the main window.

Click Output Options.

Image5

Note: Output options allows you specify where the OCR’d files should be written. I suggest writing them to a local drive and copying later to a network store.

Enable PDF Optimizer and Do not overwrite existing files.

Click the Settings Button.

IMage6

Adjust the settings to make the smallest possible files, especially for Black and White (monochrome) files:

Click OK. Give the revised settings a name such as “B&W Lossy”.

Run a Batch Sequence

Now, all you need to do is to run the batch sequence.

  1. Place all the files you wish to process in a single folders on your hard drive.
  2. Choose Advanced à Batch Processing
  3. Select the sequence to run
  4. Click OK
  5. Select the folder to process
  6. Click the Select button.
  7. Select the Output Folder

That’s it! Sit back and enjoy a cup a coffee as Acrobat does the work for you.

Want to know more about Adobe Acrobat for the legal profession?

Legal Landing page on Adobe.com

The Acrolaw Blog

Back to Top


Rick Borstein is Adobe's Business Development Manager for the Legal Market. He has over 18 years experience in the software industry and is an Adobe Certified Expert for Acrobat.