The Apache PDFBox library is an open source Java tool for working with PDF documents. This project allows the creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents. PDFBox also includes several command line utilities. PDFBox is published under the Apache License, Version 2.0.
Extract Text: Extract Unicode text from PDF files.
Split & Merge: Split a single PDF into many files or merge multiple PDF files.
Fill Forms: Extract data from PDF forms or fill a PDF form.
Preflight: Validate PDF files against the PDF/A-1b standard.
Print: Print a PDF file using the standard Java printing API.
Save as Image: Save PDFs as image files, such as PNG or JPEG.
Create PDFs: Create a PDF from scratch, with embedded fonts and images.
Signing: Digitally sign PDF files.
Apache PDFBox 1.8.14 has been released.
[PDFBOX-3625] – Use correct file encoding when saving fdf data
[PDFBOX-3724] – Wrong size in rendering of some artifacts
[PDFBOX-3733] – image added to pdf by tool AddImageToPDF is turn-overed
[PDFBOX-3739] – ImageWriterIterator.next() and ImageReaderIterator.next() can
[PDFBOX-3742] – Unknown dir object c=’>’ cInt=62 peek=’>’ peekInt=62
[PDFBOX-3813] – PDF Box crash when using PDFToImage or PDFDebug
[PDFBOX-3843] – Rect of Annotation contains indirect elements + Patch
[PDFBOX-3889] – javax.crypto.BadPaddingException: Given final block not properly
[PDFBOX-3909] – End of inline image not detected
[PDFBOX-3919] – Infinite loop while parsing (2)
[PDFBOX-3933] – PDFParser swallows a CR at the end of a stream
[PDFBOX-3942] – ClassCastException in getOptionalContentGroups
[PDFBOX-4005] – Incorrect use of PDNumberTreeNode in PDPageLabels
[PDFBOX-4011] – BBox in signature forms has wrong order
[PDFBOX-4027] – IndexOutOfBoundsException when XObject form matrix has only 5
[PDFBOX-4030] – ClassCastException when matrix array has indirect objects
[PDFBOX-4064] – cm operator has 7 numbers
[PDFBOX-4083] – Line annotation /LL, /LLE and /LLO have wrong default values
[PDFBOX-4088] – Root/StructTreeRoot/K/S must be name, not string (merge)
[PDFBOX-3635] – Remove workaround to avoid SSLHandshakeException
[PDFBOX-3865] – Add OWASP dependency-check to build
[PDFBOX-4055] – Output info when PDFBox JBIG2 ImageIO is released