> Hello! > I'm a newbie on itext and i'd like to know if it's possible to extract > plain text from an existing PDF document! > Further i'd like to know if it's possible to extract all hyperlinks (http://, > https://, ftp://, mailto:)??
What Bruno says is true, but I've had pretty good success extracting links with iText. It doesn't give much high-level support, so you'll have to read the Pdf spec, but the iText primitives are enough for you to find the Annotation array, check its type, get its action dictionary, etc. Brian ------------------------------------------------------------------------- Take Surveys. Earn Cash. Influence the Future of IT Join SourceForge.net's Techsay panel and you'll get the chance to share your opinions on IT & business topics through brief surveys-and earn cash http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV _______________________________________________ iText-questions mailing list iText-questions@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://itext.ugent.be/itext-in-action/