Return to Snippet

Revision: 72660
at October 12, 2017 21:20 by markt22


Initial Code
/**
 * @param args the command line arguments
 * @throws java.io.IOException
 */
public static void main(String[] args) throws IOException {
    File file = new File("");
 
    PDDocument document = PDDocument.load(file);
    
    PDFTextStripper pdfStripper = new PDFTextStripper();

    String text = pdfStripper.getText(document);
    System.out.println(text);
    
    document.close();
}

Initial URL

                                

Initial Description
Opens an existing PDF and extracts all the text from it

Initial Title
PDFBox: Extract all text from a document

Initial Tags

                                

Initial Language
Java