Trouble opening file with Java to use with PDFBox

10 次查看(过去 30 天)
I am trying to use the PDFBox library to read the contents of PDF files, but I can't seem to open any of the files in the correct format for PDFBox to use. I'm using the following code to open each document:
javaaddpath('...\pdfParseDemo\pdfbox-2.0.0.jar')
javaaddpath('...\FontBox-0.1.0\FontBox-0.1.0\lib\FontBox-0.1.0.jar')
pdfname = '...\example.pdf';
import java.io.*;
pdfdoc = org.apache.pdfbox.pdmodel.PDDocument; %Define a PDDocument object placeholder
pdfdoc.load(FileInputStream(pdfname)); %Load the PDF file
However, this seems to return an empty object. When I try to query any of the file's properties or contents, it always returns an empty or zero value. I suspect the problem is with how I'm opening the file, because I know PDFBox has been successfully used natively with Java in many cases. Unfortunately the documentation for interfacing with Matlab is very sparse, so I'm not sure what I should be doing differently. Is there some kind of weirdness with how Matlab handles Java file input calls?

采纳的回答

Elias Gule
Elias Gule 2016-5-9
Try wrapping your pdfname variable in a java.lang.String variable. This sometimes works:
pdfname = java.lang.String('...\example.pdf');
  2 个评论
Michael Boeckel
Michael Boeckel 2016-5-9
Good suggestion, but no joy, sadly. I tried this with several different variations on the loading call, but none worked.
Michael Boeckel
Michael Boeckel 2016-5-10
Disregard! I tried your suggestion with the Java "File" constructor instead of the FileInputStream constructor, and with a bit of coaxing, that worked! Many thanks good sir!

请先登录,再进行评论。

更多回答(0 个)

类别

Help CenterFile Exchange 中查找有关 Call Java from MATLAB 的更多信息

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by