Hi

I'm try to extract PDF Text content automatically,

  The problem is when I encounter Text in different table structure, I 

 Couldn't differentiate between headers and columns values,

 I'm using Eclipse as JAVA2 IDE and most popular PDF Lib. (JPedal, iText,
PDFOne 

 Java, PDFBox) all these Libraries extract Text as fine but doesn't  Give me
capabilities

To Detect PDF Table in table format (headers and columns).

 

So I will appreciate any help from your side

 

thanks

 

 

------------------------------------------------------------------------------
Download Intel® Parallel Studio Eval
Try the new software tools for yourself. Speed compiling, find bugs
proactively, and fine-tune applications for parallel performance.
See why Intel Parallel Studio got high marks during beta.
http://p.sf.net/sfu/intel-sw-dev
_______________________________________________
iText-questions mailing list
iText-questions@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/itext-questions

Buy the iText book: http://www.1t3xt.com/docs/book.php
Check the site with examples before you ask questions: 
http://www.1t3xt.info/examples/
You can also search the keywords list: http://1t3xt.info/tutorials/keywords/

Reply via email to