top of page

Tuesday Tips for Legal Professionals: OCR Your PDFs Without Wrecking Print Quality

  • Feb 17
  • 4 min read

Quick overview (for skim readers): When you OCR a PDF, you can maintain high quality. Bad print comes from compression, not from OCR itself.


To stay safe:

  • Treat OCR and “Reduce File Size” as two separate steps.

  • Scan at 300 dpi for anything that will be printed and keep that resolution.

  • In your OCR tool, turn off downsampling and avoid “Smallest file size” presets.

  • Keep two versions:

    • Master – High Quality (for print)

    • E-file / Email Copy – Compressed (for upload only)

  • Never send the compressed version to your printer as the print master.


Do that, and you get searchable PDFs that still look sharp in appeal books, factums, and BOAs.

1. What OCR actually does (and what ruins quality)


When you “OCR a PDF,” two things can happen:

  1. The software looks at each page image and creates an invisible text layer on top so you can search, select, and copy text.

  2. Depending on the settings, it may also reprocess the page images:

    • Downsample them to a lower dpi

    • Use heavier compression (more JPEG artifacts)

    • Try to “optimize” for small file size

OCR itself does not have to change the page layout at all. The quality issues stem from the additional optimization/compression settings.


So the goal is simple: Add the text layer. Leave the image layer alone.


2. Start with a good scan (this decides how far you can go)


If your source is paper, your scan settings set the quality ceiling.


For legal work that will be printed in court books:

  • Resolution:

    • Use 300 dpi as your default for text-heavy documents.

    • 200 dpi is usually okay for purely text, but 300 dpi is safer for stamps, signatures, and Bates.

  • Colour mode:

    • Black & white or greyscale for clean text-only pages.

    • Colour for:

      • Poor-quality faxes

      • Photos, maps, diagrams

      • Documents with coloured highlights, seals, or small annotations

  • Straight pages, full edges:

    • Make sure nothing is cut off at margins.

    • Keep pages straight so text doesn’t look skewed once bound.


Once you have a solid 300 dpi scan, you can OCR with confidence and still print well.

3. How to OCR without lowering resolution


Every tool looks different, but the principles are the same. When you run OCR:

  1. Choose a mode that keeps the original image

    • If you see options like:

      • “Searchable Image (Exact)” vs “ClearScan” or “Editable text and images”

    • Pick the one that says Exact or Keep original appearance.

  2. Turn off downsampling

    • Look for an advanced or “Images” section.

    • If it says things like:

      • “Downsample to 150 dpi / 200 dpi”

    • Either:

      • Set Do not downsample, or

      • Keep it at 300 dpi.

  3. Avoid “Smallest file size / High compression” presets

    • Anything marketed as:

      • “Optimize for web.”

      • “Smallest file size”

    • will usually hammer the images.

  4. Don’t combine OCR and reduce-size in one move

    • Do OCR only first.

    • Save that as your high-quality master.

    • If you need a smaller file for e-filing or email, make a separate compressed copy.


Result: Your master PDF looks just like the scan, but now it’s searchable and court-ready.

4. Why do you need two versions: Master vs E-file copy


This is where most legal PDF problems start: people OCR, then hit “Reduce File Size,” and send that shrunk version to the printer.


For court work, get into this habit:

  • Master – High Quality

    • 300 dpi

    • OCR applied

    • Minimal or no extra compression

    • Used for:

      • Appeal books

      • Books of authorities

      • Factums with embedded exhibits

      • Any print master going to your printer

  • E-file / Email Copy – Compressed

    • Same content and pagination as the master

    • File-size reduction applied just enough to meet limits

    • Used for:

      • Court e-filing portals

      • Emailing to other parties if size is a problem


Name them clearly in your folders, for example:

  • Appeal_Record_Vol1_MASTER_300dpi_OCR.pdf

  • Appeal_Record_Vol1_EFILE_Optimized.pdf


And make it a rule in your team:


The printer always gets the MASTER file, not the e-file copy.

5. What can go wrong if you over-compress an OCR’d PDF


This is what we see from the print side when OCR + heavy compression goes wrong:

  • Soft text

    • Text looks slightly blurred at normal reading distance, especially in smaller font sizes and footnotes.

  • Fuzzy Bates/page numbers

    • The numbers you rely on for citations are the first to suffer from aggressive compression.

  • Muddy stamps and signatures

    • Court stamps, seals, and signatures lose sharp edges and can start to look “blocky.”

  • Illegible fine detail

    • Maps, engineering drawings, medical images, or complex tables lose the line detail you need.


These problems are nearly impossible to fix at print time without going back to a better source.

6. Quick checklist before you send files to your printer


For any OCR’d PDF that will be a print master for appeal books, BOAs, factums, or records:

  •  Was the original scan done at 300 dpi (or better)?

  •  Is the OCR’d PDF still sharp at 100% zoom on screen?

  •  Did you avoid using “Smallest file size” / super-compressed settings on the master?

  •  Do you have a clearly named MASTER version separate from any compressed e-file copy?

  •  Are you sending the MASTER to your printer?


If all those are yes, your chances of a clean print job go way up.

7. How CETTEC fits into this


From our side at CETTEC Printing, OCR and resolution show up in very practical way:

  • We can check your PDFs for obvious quality problems before running large print jobs.


E-filing, searchable PDFs, and high-quality print don’t have to be in conflict. If your team builds a simple habit around good scans, careful OCR, and master vs e-file copies, you’ll see fewer fuzzy exhibits and far fewer “this looked fine on screen” surprises when the books come back from print.

Recent Posts

See All
bottom of page