PdfParser

Pdf Parser, a standalone PHP library, provides various tools to extract data from a PDF file.

Test the API on our demo page.

This project is supported by Actualys.

Features

Features included :

Load/parse objects and headers
Extract meta data (author, description, ...)
Extract text from ordered pages
Support of compressed pdf
Support of MAC OS Roman charset encoding
Handling of hexa and octal encoding in text sections
PSR-0 compliant (autoloader)
PSR-1 compliant (code styling)

Currently, secured documents are not supported.

This Library is under active maintenance. There is no active development by the author of this library (at the moment), but we welcome any pull request adding/extending functionality!

Documentation

Read the documentation on website.

Original PDF References files can be downloaded from this url: http://www.adobe.com/devnet/pdf/pdf_reference_archive.html

For developers: Please read DEVELOPER.md for more information about local development of the PDFParser library.

Installation

Using Composer

Obtain Composer
Run composer require smalot/pdfparser

Use alternate file loader

In case you can't use Composer, you can include alt_autoload.php-dist into your project. It will load all required files at once. Afterwards you can use PDFParser class and others.

License

This library is under the LGPLv3 license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

PdfParser

Features

Documentation

Installation

Using Composer

Use alternate file loader

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

PdfParser

Features

Documentation

Installation

Using Composer

Use alternate file loader

License