Tuesday Tips for Legal Professionals: OCR Your PDFs Without Wrecking Print Quality
- Feb 17
- 4 min read
Quick overview (for skim readers): When you OCR a PDF, you can maintain high quality. Bad print comes from compression, not from OCR itself.
To stay safe:
Treat OCR and “Reduce File Size” as two separate steps.
Scan at 300 dpi for anything that will be printed and keep that resolution.
In your OCR tool, turn off downsampling and avoid “Smallest file size” presets.
Keep two versions:
Master – High Quality (for print)
E-file / Email Copy – Compressed (for upload only)
Never send the compressed version to your printer as the print master.
Do that, and you get searchable PDFs that still look sharp in appeal books, factums, and BOAs.
1. What OCR actually does (and what ruins quality)
When you “OCR a PDF,” two things can happen:
The software looks at each page image and creates an invisible text layer on top so you can search, select, and copy text.
Depending on the settings, it may also reprocess the page images:
Downsample them to a lower dpi
Use heavier compression (more JPEG artifacts)
Try to “optimize” for small file size
OCR itself does not have to change the page layout at all. The quality issues stem from the additional optimization/compression settings.
So the goal is simple: Add the text layer. Leave the image layer alone.
2. Start with a good scan (this decides how far you can go)
If your source is paper, your scan settings set the quality ceiling.
For legal work that will be printed in court books:
Resolution:
Use 300 dpi as your default for text-heavy documents.
200 dpi is usually okay for purely text, but 300 dpi is safer for stamps, signatures, and Bates.
Colour mode:
Black & white or greyscale for clean text-only pages.
Colour for:
Poor-quality faxes
Photos, maps, diagrams
Documents with coloured highlights, seals, or small annotations
Straight pages, full edges:
Make sure nothing is cut off at margins.
Keep pages straight so text doesn’t look skewed once bound.
Once you have a solid 300 dpi scan, you can OCR with confidence and still print well.
3. How to OCR without lowering resolution
Every tool looks different, but the principles are the same. When you run OCR:
Choose a mode that keeps the original image
If you see options like:
“Searchable Image (Exact)” vs “ClearScan” or “Editable text and images”
Pick the one that says Exact or Keep original appearance.
Turn off downsampling
Look for an advanced or “Images” section.
If it says things like:
“Downsample to 150 dpi / 200 dpi”
Either:
Set Do not downsample, or
Keep it at 300 dpi.
Avoid “Smallest file size / High compression” presets
Anything marketed as:
“Optimize for web.”
“Smallest file size”
will usually hammer the images.
Don’t combine OCR and reduce-size in one move
Do OCR only first.
Save that as your high-quality master.
If you need a smaller file for e-filing or email, make a separate compressed copy.
Result: Your master PDF looks just like the scan, but now it’s searchable and court-ready.
4. Why do you need two versions: Master vs E-file copy
This is where most legal PDF problems start: people OCR, then hit “Reduce File Size,” and send that shrunk version to the printer.
For court work, get into this habit:
Master – High Quality
300 dpi
OCR applied
Minimal or no extra compression
Used for:
Appeal books
Books of authorities
Factums with embedded exhibits
Any print master going to your printer
E-file / Email Copy – Compressed
Same content and pagination as the master
File-size reduction applied just enough to meet limits
Used for:
Court e-filing portals
Emailing to other parties if size is a problem
Name them clearly in your folders, for example:
Appeal_Record_Vol1_MASTER_300dpi_OCR.pdf
Appeal_Record_Vol1_EFILE_Optimized.pdf
And make it a rule in your team:
The printer always gets the MASTER file, not the e-file copy.
5. What can go wrong if you over-compress an OCR’d PDF
This is what we see from the print side when OCR + heavy compression goes wrong:
Soft text
Text looks slightly blurred at normal reading distance, especially in smaller font sizes and footnotes.
Fuzzy Bates/page numbers
The numbers you rely on for citations are the first to suffer from aggressive compression.
Muddy stamps and signatures
Court stamps, seals, and signatures lose sharp edges and can start to look “blocky.”
Illegible fine detail
Maps, engineering drawings, medical images, or complex tables lose the line detail you need.
These problems are nearly impossible to fix at print time without going back to a better source.
6. Quick checklist before you send files to your printer
For any OCR’d PDF that will be a print master for appeal books, BOAs, factums, or records:
Was the original scan done at 300 dpi (or better)?
Is the OCR’d PDF still sharp at 100% zoom on screen?
Did you avoid using “Smallest file size” / super-compressed settings on the master?
Do you have a clearly named MASTER version separate from any compressed e-file copy?
Are you sending the MASTER to your printer?
If all those are yes, your chances of a clean print job go way up.
7. How CETTEC fits into this
From our side at CETTEC Printing, OCR and resolution show up in very practical way:
We can check your PDFs for obvious quality problems before running large print jobs.
E-filing, searchable PDFs, and high-quality print don’t have to be in conflict. If your team builds a simple habit around good scans, careful OCR, and master vs e-file copies, you’ll see fewer fuzzy exhibits and far fewer “this looked fine on screen” surprises when the books come back from print.
