Apache PDFBox - Java PDF Library Apache PDFBox is an open source Java PDF library for working with PDF documents. This project allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. Apache PDFBox also includes a command line utility. Features * PDF to text extraction * Merge PDF Documents * PDF Document Encryption/Decryption * Lucene Search Engine Integration * Fill in form data FDF and XFDF * Create a PDF from a text file * Create images from PDF pages * Print a PDF WWW: http://pdfbox.apache.org/