0 Replies Latest reply: Apr 6, 2010 6:03 AM by 843804 RSS

    Exception while PDF Parsing through PDFBOX jar

    843804
      While I parsing PDF file, I got the following exception.I used PDFBox-0.7.3 jar for pdf parsing.

      java.io.IOException: expected='endobj' firstReadAttempt='' secondReadAttempt='' org.apache.pdfbox.io.PushBackInputStream@1027b4d
           at org.apache.pdfbox.pdfparser.PDFParser.parseObject(PDFParser.java:485)
           at org.apache.pdfbox.pdfparser.PDFParser.parse(PDFParser.java:165)
           at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:847)
           at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:814)
           at org.apache.pdfbox.pdmodel.PDDocument.load(PDDocument.java:785)
           at TestPDFReader.extractPDFContents(TestPDFReader.java:50)
           at TestPDFReader.getFileContents(TestPDFReader.java:29)
           at TestPDFReader.main(TestPDFReader.java:70)

      Can any one suggest me solution for it.

      Thanks in Advance
      Sandeep Verma