Hi, I am using the PDFBox for one of the application. What I am doing is I am extracting the PDF text from the PDF and generating the TOC entries. But I am facing one problem, that is, if the PDF contains these two characters "✠"(✠) and "Ⓔ"(Ⓔ) then the processpage(PDPage, COSStream) gives an IOException "Unknown encoding for 'UniJIS-UCS2-H' ". Can you let us know is there any way as to overcome this problem?
Regards Natraj
