Re: 遇到一个无法解决的bug

2024-02-05 Thread Tilman Hausherr
Hello, Please explain your problem in englisch and mention what PDFBox version you are using. Apparently it's about text extraction, read this first: https://pdfbox.apache.org/3.0/faq.html#how-come-i-am-getting-gibberish(g38g43g36g51g5)-when-extracting-text%3F Try extracting your test with

遇到一个无法解决的bug

2024-02-05 Thread 软件开发岗位夏志强
public List> readPdfString(File file ,int pageNum) { List> result = Collections.synchronizedList(new ArrayList>()); PDDocument doc = null; PDDocument originalDocument=null; try{ // 创建新的 PDF 文档 originalDocument=PDDocument.load(file); doc = new PDDocument(); // 遍历原始文档的页面并复制到新文档 for