Hello,
Please explain your problem in englisch and mention what PDFBox version
you are using. Apparently it's about text extraction, read this first:
https://pdfbox.apache.org/3.0/faq.html#how-come-i-am-getting-gibberish(g38g43g36g51g5)-when-extracting-text%3F
Try extracting your test with
public List> readPdfString(File file ,int pageNum) {
List> result = Collections.synchronizedList(new
ArrayList>());
PDDocument doc = null;
PDDocument originalDocument=null;
try{
// 创建新的 PDF 文档
originalDocument=PDDocument.load(file);
doc = new PDDocument();
// 遍历原始文档的页面并复制到新文档
for
2 matches
Mail list logo