PDFdeconstruct™ decomposes PDF files into XML files. The XML output includes:

PDFdeconstruct can be used for:

The PDFdeconstruct output format is described in the manual.

PDFdeconstruct is a cross-platform command-line tool, suitable for use on servers or for batch-mode processing.

Supported platforms:

See also: For conversion to plain text (instead of XML), try our XpdfText library.

Contact Glyph & Cog for more information including evaluation copies.