File Format Wiki

PDF format rules

Portable document format for final-layout documents.

Format rules

  • Binary structure
  • Text extraction is not guaranteed
  • Can contain forms, scripts, and metadata

Valid example

Use a trusted PDF generator and validate output.

Invalid example

Treating PDF as plain text

Common errors

  • Assuming PDF accepts syntax from a similar format
  • Using the wrong encoding or line ending
  • Copying invisible characters from rich text
  • Testing only the happy path and not parser errors

Online validation and conversion

Compared with nearby formats

PDF should be chosen for the parser and ecosystem that will consume it. Prefer strict formats for APIs, human-friendly formats for ops config, and signed formats only when verification is required.