← All guides

What is a PDF File? (And Why Everyone Uses It)

pdfexplainerbasics

PDF stands for “Portable Document Format.” That technical name actually tells you most of what you need to know about why PDFs exist: they’re documents that travel between computers without breaking.

Here’s the plain-English version of what a PDF is, why it’s different from a Word document, and why nearly every formal document you receive comes as a PDF.

The problem PDFs solve

Imagine you write a beautiful document in Microsoft Word — careful fonts, precise spacing, a logo at the top, a footer at the bottom, a chart in the middle. You email it to someone. They open it on their computer, in their version of Word, with their installed fonts and screen resolution.

What they see is different from what you see. Maybe their version of Word doesn’t have the font you used, so it substituted Arial — and now everything is slightly larger. Maybe their screen is wider, so the text reflows differently. Maybe the chart you embedded looks broken because it referenced something only on your machine.

This was a real, daily problem in the 1990s. Documents looked different on every machine. Designers, lawyers, and businesses needed a way to say “this document, exactly as I designed it, no matter what computer opens it.”

Adobe invented PDF in 1993 to solve exactly this. A PDF stores not just the text content but the exact appearance — every character at every position, every font embedded inside the file, every image at its precise dimensions. Open a PDF on any device, any OS, any reader, and you see the same thing.

How a PDF differs from a Word document

The key technical difference: PDFs are fixed; Word documents are reflowable.

A Word document (.docx) is essentially structured content. It says: “Here’s a heading. Here’s a paragraph. Here’s a list.” The Word program (or Google Docs, or LibreOffice) figures out how to lay it out on the screen or page when you open it. Different software might lay it out slightly differently.

A PDF is essentially a rendered page. It says: “At position (100, 200), put the letter ‘A’ in Times New Roman 12pt. At position (108, 200), put the letter ‘B’.” There’s no interpretation — every character’s position is locked in.

This is why:

  • You can edit a Word document but it’s hard to edit a PDF
  • Word documents change appearance across software but PDFs don’t
  • Word documents are typically smaller for text but PDFs are smaller for image-heavy content
  • Word documents reflow when printed at a different page size but PDFs stay the same

What’s inside a PDF

If you opened a PDF file in a text editor (don’t — it would look like garbage), you’d see a structured format containing:

  • Page descriptions: detailed instructions for drawing each page (text positioning, lines, shapes, colors)
  • Embedded fonts: copies of all fonts the document uses, so they display the same way on every machine
  • Embedded images: photos, charts, logos compressed and stored inside the file
  • Metadata: title, author, creation date, software used to create the PDF
  • Optional features: bookmarks, hyperlinks, form fields, annotations, digital signatures

The “embedded fonts” part is the magic: even if the recipient’s computer doesn’t have the fonts you used, the PDF includes them. Display works identically.

Why nearly every formal document is a PDF

Six reasons:

1. Consistent appearance. What you see is what they see. No “your version is broken” issues.

2. Universal compatibility. Every operating system has a built-in PDF reader. Every modern browser opens PDFs. PDF support is one of the few things Apple, Microsoft, and Google all agree on.

3. Printable. PDFs were designed around fixed page sizes. Print a PDF and you get exactly what you see on screen.

4. Hard to modify accidentally. A recipient can’t easily edit a PDF and resave it. The document arrives finalized.

5. Searchable when properly created. Text-based PDFs (from Word, etc.) support full-text search. You can Ctrl+F within a PDF the same as in any document.

6. Standardized. PDF/A is a specific subset of PDF used for long-term archival. Many government and legal records are required to be in PDF/A specifically because the format is stable and predictable.

When PDFs are NOT the right choice

PDFs aren’t perfect for every use case:

  • Documents that need to be edited: use Word/Google Docs, not PDF
  • Real-time collaboration: use Google Docs, Microsoft 365 web
  • Content for websites: use HTML, not PDF
  • Short messages: just use email body text
  • Notes you might revisit later: keep them as plain text or in a notes app

PDFs shine when the document is “done” and you want it to stay that way. They’re worse when you’re still working on it.

Things you can do with PDFs

Once you have a PDF, common operations include:

All of these run in your browser without uploading the PDF anywhere.

The two flavors of PDF — text-based vs scanned

This is the most important distinction for working with PDFs:

Text-based PDFs were created from text — Word documents, web pages, code, anything where the source had selectable text. The PDF stores the actual text characters. You can select, copy, and search within these PDFs.

Scanned (image-based) PDFs were created from images — paper scans, phone photos, faxes. The PDF stores pictures of pages. You can’t select text in these PDFs because there’s no text, just images.

To make a scanned PDF text-functional, you need OCR — Optical Character Recognition that reads text from the page images and adds a searchable text layer.

See our guide on Why Your Scanned PDF Won’t Let You Copy Text for the full explanation.

How PDFs get created

Most PDFs come from one of these workflows:

  • From Word/Google Docs/Pages: File → Save As PDF (or Export to PDF). Built into every modern word processor.
  • From web pages: Print → “Save as PDF” in any browser’s print dialog
  • From any application that can print: same print-to-PDF approach
  • From scanners and phone scan apps: physical paper → PDF directly
  • From images: JPG to PDF, PNG to PDF combine images into a multi-page PDF
  • From markdown / plain text: MD to PDF, TXT to PDF for content already in those formats

For most users: “Save as PDF” in your word processor is how you make a PDF. For batch conversions or specific transformations, the browser-based tools above.

File size expectations

How big should a PDF be?

  • Text-only document, 10 pages: 50-200 KB
  • Text with embedded images, 10 pages: 500 KB - 5 MB
  • Scanned document, 10 pages at 150 DPI: 5-20 MB
  • Scanned document, 10 pages at 300 DPI: 15-50 MB
  • PDF with many high-quality photos: 10 MB - 100+ MB

If your PDF is unexpectedly large (a “Word doc as PDF” that’s 20 MB), it usually means embedded images at higher resolution than necessary. Run through PDF Compressor to reduce size.

PDF history (briefly)

  • 1993: Adobe launches PDF 1.0
  • 2008: PDF becomes an open standard (ISO 32000). Anyone can implement PDF readers and writers without Adobe’s permission.
  • 2017: PDF 2.0 standard published
  • Today: PDF is the de facto format for “send this document to someone”

The transition to open standard in 2008 is part of why PDF is so universal now. Anyone can make a PDF reader. Browsers do it natively. Operating systems include built-in support. It’s not a proprietary Adobe format anymore.

TL;DR

  • PDF = Portable Document Format, designed so documents look the same on every device
  • Fixed appearance is the key feature — characters locked at exact positions, fonts embedded
  • Universal compatibility — every device, every OS, every browser opens them
  • Best for: finished documents, formal documents, anything to be printed, anything to be shared
  • Worst for: documents that need editing, real-time collaboration, web content
  • Two flavors: text-based (searchable) vs scanned (images only, needs OCR)
  • Tools: merger, splitter, compressor, rotator, OCR for working with existing PDFs