Why Is My PDF So Large? 5 Causes and How to Fix Each One
By the PdfToolbox team · June 11, 2026
A two-page document should be a few hundred kilobytes. So why is yours 20 MB? PDFs balloon for a handful of predictable reasons — and once you know which one you’re dealing with, the fix is usually quick.
1. It’s a scan
The most common cause by far. A scanner doesn’t store text — it stores a photograph of every page, often at 300–600 DPI. Ten scanned pages can easily outweigh a thousand pages of real text.
Fix: re-encode the page images. Run the file through Compress PDF with Strong mode, which re-encodes each page as an optimized image. Scans routinely shrink by 70–90%.
2. Oversized photos pasted into the document
A phone photo is 3–10 MB. Paste four of them into a report and export to PDF, and the full originals usually ride along — even if they’re displayed two centimetres wide.
Fix: compression re-encodes embedded images at a sensible size. If you’re building the document from photos in the first place, JPG to PDF keeps things lean from the start.
3. Embedded fonts — multiplied
Each embedded font adds anywhere from a few KB to over a megabyte (CJK and decorative fonts are the heavy ones). Documents assembled from multiple sources can carry a dozen full font files, many of them duplicates.
Fix: Light compression rewrites the file’s internal structure and removes redundant objects without touching quality — text stays crisp and selectable.
4. Pages nobody needs
Cover sheets, blank scanner pages, the 40 appendix pages after the 2 pages you actually need to send.
Fix: delete the dead weight, or extract just the pages you need into a new file. Fewer pages is the one size reduction that costs nothing in quality.
5. Revision history and leftover data
PDFs support incremental saving: some editors append every edit to the end of the file instead of rewriting it, so the file grows with every save — and old, “deleted” content is still in there.
Fix: any full rewrite of the file discards the accumulated history. Light compression does exactly that as a side effect.
Find your culprit in 10 seconds
Can you select the text in your PDF? If you can’t, it’s a scan — expect big savings from Strong mode. If you can, the weight is in images, fonts or history — start with Light mode and keep your text selectable. Either way, you can check the result instantly: the compressor runs entirely in your browser, so there’s no upload, no queue and no file-size cap.
Facing a hard limit like “max 100 KB” on an application portal? There’s a dedicated guide and tool for that.