[iText-questions] Can't extract russian text from pdf

2008-09-11 Thread Сергій Карпенко
Hi I wrote a code few month ago. It works well with latin font. But do not extract russian, and seems any not-latin symbols. There is a code:   PdfReader reader = new PdfReader(is, "".getBytes());   PRTokeniser token;   byte[] m = reader.getMetadata();   String sm = n

Re: [iText-questions] iText and PDF/A metadata (Leonard Rosenthol)

2008-09-05 Thread Сергій Карпенко
Thanks a lot, Leonard I found PdfReader.getMetadata() It returns a byte array with XML. Off course, I can parse it (And I already done it). But getInfo() is much more useful - it returns a HashMap of properties. There is a reason make parsing on iText side. Becouse iText-deve

[iText-questions] iText and PDF/A metadata

2008-09-05 Thread Сергій Карпенко
Hi all I use iText for extract PDF text and PDF-file metadata. In last case code is simple PdfReader reader = new PdfReader(is,"".getBytes()); //Read the file metadata HashMap info = reader.getInfo(); reader.close(); String author = (String)info.get("Author");