pdfinfo - Portable Document Format (PDF) document information extractor (version 3.03)
Contents
Description
Pdfinfo prints the contents of the ´Info' dictionary (plus some other useful information) from a Portable
Document Format (PDF) file.
If PDF-file is ´-', it reads the PDF file from stdin.
The ´Info' dictionary contains the following values:
title
subject
keywords
author
creator
producer
creation date
modification date
In addition, the following information is printed:
custom metadata (yes/no)
metadata stream (yes/no)
tagged (yes/no)
userproperties (yes/no)
suspects (yes/no)
form (AcroForm / XFA / none)
javascript (yes/no)
page count
encrypted flag (yes/no)
print and copy permissions (if encrypted)
page size
file size
linearized (yes/no)
PDF version
metadata (only if requested)
The options -listenc, -meta, -js, -struct, and -struct-text only print the requested information. The
'Info' dictionary and related data listed above is not printed. At most one of these five options may be
used.
Exit Codes
The Xpdf tools use the following exit codes:
0 No error.
1 Error opening a PDF file.
2 Error opening an output file.
3 Error related to PDF permissions.
99 Other error.
Name
pdfinfo - Portable Document Format (PDF) document information extractor (version 3.03)
Options
-fnumber
Specifies the first page to examine. If multiple pages are requested using the "-f" and "-l"
options, the size of each requested page (and, optionally, the bounding boxes for each requested
page) are printed. Otherwise, only page one is examined.
-lnumber
Specifies the last page to examine.
-box Prints the page box bounding boxes: MediaBox, CropBox, BleedBox, TrimBox, and ArtBox.
-meta Prints document-level metadata. (This is the "Metadata" stream from the PDF file's Catalog
object.)
-custom
Prints custom and standard metadata.
-js Prints all JavaScript in the PDF.
-struct
Prints the logical document structure of a Tagged-PDF file.
-struct-text
Print the textual content along with the document structure of a Tagged-PDF file. Note that
extracting text this way might be slow for big PDF files. (Implies -struct.)
-url Print all URLs in the PDF. Only the URL types supported by Poppler are listed. Currently, this is
limited to Annotations. Note: only URLs referenced by the PDF objects such as Link Annotations are
listed. pdfinfo does not attempt to extract strings matching http://... from the text content.
-isodates
Prints dates in ISO-8601 format (including the time zone).
-rawdates
Prints the raw (undecoded) date strings, directly from the PDF file.
-dests Print a list of all named destinations. If a page range is specified using "-f" and "-l", only
destinations in the page range are listed.
-encencoding-name
Sets the encoding to use for text output. This defaults to "UTF-8".
-listenc
Lits the available encodings
-opwpassword
Specify the owner password for the PDF file. Providing this will bypass all security
restrictions.
-upwpassword
Specify the user password for the PDF file.
-v Print copyright and version information.
-h Print usage information. (-help and --help are equivalent.)
See Also
pdfdetach(1), pdffonts(1), pdfimages(1), pdftocairo(1), pdftohtml(1), pdftoppm(1), pdftops(1), pdftotext(1) pdfseparate(1), pdfsig(1), pdfunite(1) 15 August 2011 pdfinfo(1)
Synopsis
pdfinfo [options] [PDF-file]
