The trick is using System.Reflection to expose hidden (private) properties of the PDFbox Page object. Program creates 1 image for each page of a PDF, computes word locations (if PDF is OCR'ed) then ...
PDF Compressor made with Tkinter and Ghostscript. Contribute to n4ff4h/pdf-compressor development by creating an account on GitHub.