Hi, I just joined the list recently because I wanted to make the best use of various US government forms that are offered in PDF format but may obscure information with formatting "junk." For example, US tax forms are available that can be filled in but I had no idea what utilities there are to extract the information ( numbers you enter ) from all the formatting stuff. I've gotten generally disgusted with things that obscure the information in this way but they are coming up everywhere.
After playing with pdftotext and the various tools on cygwin such as pdftk , I thought it may be worthwhile to learn about more general libraries and itext looks like it will be quite helpful. I was able to check out and build from source quite easily. I even managed to write my own code to use your parsers to extract text and traverse the document but it will be a while before I understand PDF structures. However, I am curious to know what command line utilities there are, open source or not, for loading and extracting form data from pdf files. Thanks. Mike Marchywka 586 Saint James Walk Marietta GA 30067-7165 415-264-8477 (w)<- use this 404-788-1216 (C)<- leave message 989-348-4796 (P)<- emergency only [email protected] Note: If I am asking for free stuff, I normally use for hobby/non-profit information but may use in investment forums, public and private. Please indicate any concerns if applicable. Note: hotmail is getting cumbersom, try also [email protected] _________________________________________________________________ Windows Live⢠Groups: Create an online spot for your favorite groups to meet. http://windowslive.com/online/groups?ocid=TXT_TAGLM_WL_groups_032009 ------------------------------------------------------------------------------ Open Source Business Conference (OSBC), March 24-25, 2009, San Francisco, CA -OSBC tackles the biggest issue in open source: Open Sourcing the Enterprise -Strategies to boost innovation and cut costs with open source participation -Receive a $600 discount off the registration fee with the source code: SFAD http://p.sf.net/sfu/XcvMzF8H _______________________________________________ iText-questions mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/itext-questions Buy the iText book: http://www.1t3xt.com/docs/book.php
