Re: [iText-questions] get plain text and links

Brian McKeever Thu, 08 Mar 2007 09:32:19 -0800

> Hello!
> I'm a newbie on itext and i'd like to know if it's possible to extract
> plain text from an existing PDF document!
> Further i'd like to know if it's possible to extract all hyperlinks (http://, 
> https://, ftp://, mailto:)??


What Bruno says is true, but I've had pretty good success extracting
links with iText.  It doesn't give much high-level support, so you'll
have to read the Pdf spec, but the iText primitives are enough for you
to find the Annotation array, check its type, get its action
dictionary, etc.

Brian

-------------------------------------------------------------------------
Take Surveys. Earn Cash. Influence the Future of IT
Join SourceForge.net's Techsay panel and you'll get the chance to share your
opinions on IT & business topics through brief surveys-and earn cash
http://www.techsay.com/default.php?page=join.php&p=sourceforge&CID=DEVDEV
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions
Buy the iText book: http://itext.ugent.be/itext-in-action/

Re: [iText-questions] get plain text and links

Reply via email to